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Human preproinsuiin-Iike growth factor I. 



(jjj The invention provides DNA sequences encoding a 
human preproinsulin-like growth factor I protein and novel 
extension peptides. A novel preproinsulin-like growth factor I 
protein and novel extension peptides are also provided. The 
present invention further provides a human IGF-I gene which 
has been sequenced and which encodes at least two pre- 
proinsulin-like growth factor-l proteins. Various genes and 
DNA sequences useful in producing essentially pure mature 
IGF-I, preproinsulin-like growth factor I proteins and IGF-I 
gene related proteins are also provided. 
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HUMAN PREPROINSULIN-LIKE GROWTH FACTOR I 

The present invention relates to a novel 
insulin-like growth factor precursor protein and 
novel human preproinsulin-like growth factor I genes 
5 and DNA sequences. 

Background of the Invention 

Insulin-like growth factors (IGF's) have 
been isolated from various animal species, and are 
believed to be active growth promoting molecules that 
10 mediate the anabolic effects of such hormones as 
growth hormone and placental lactogen. As such, 
IGF's should be useful in the treatment and/or poten- 
tiation of various growth conditions and/or wound 
healing . 

I 5 The designation "insulin-like growth fac- 

tors" was chosen to express the insulin-like effects 
and insulin-like structure of these peptides. IGF's 
share nearly 50% amino acid homology with insulin and 
in thr 

ee dimensional structure resemble proinsulin. 

2 0 Furthermore, by three dimensional modeling, the 

structures of IGF's are similar to proinsulin being a 
single chain peptide, cross-linked by three disulfide 
bridges and consisting of a B-chain-like amino- 
terminal part (B domain), a connecting peptide (C 
25 domain), and an A-chain-like part (A domain). In 

addition, a carboxyl-terminal extension not found in 
proinsulin is present (D domain). Recent studies also 
report the presence of yet another carboxyl-terminal 
extension not found in proinsulin which has been 

3 0 given an E domain designation. The E domain peptide 

has thus far been identified in association with rat 
and human IGF-II. Hylka et al . (1985) and Zumstein 
et al. (1985) . 
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weight material may represent an IGF-I precursor. 
The authors also demonstrated that metabolic regula- 
tion of the higher molecular weight rat liver IGF-I 
was similar to rat serum-derived IGF-I. Recently, 
5 Zumstein et al . (1985) isolated a variant pro-form of 
IGF- I I from human serum which they demonstrated to 
contain IGF-I I-like activity in vitro . 

Because of the potential bioactivity and 
utility of high molecular weight, precursor IGF-I 

10 proteins in the treatment and/or potentiation of 

various growth conditions, the amino acid sequence 
and DNA coding sequence of such a precursor protein 
has long been sought. 

Jansen et al . (1933), provided an amino 

15 acid sequence derived from a human IGF-I cDNA clone 
which supports the suggestion of a larger IGF-I 
precursor. See also Netherlands Patent Application 
No. 83 02324, published January 16, 1985. The cDNA 
disclosed by Jansen, however, did not provide suffi- 

2 0 cient DNA sequence information to teach the precise 
translational start of the suggested precursor pro- 
tein. Additionally, Jansen et al . (1983) and Nether- 
lands Patent Application No. 8302324 provide no 
evidence or teaching that the cDNA sequence is or can 

2 5 be expressed (e. g., that the suggested precursor 
protein is produced). 

Indeed, very little is known about IGF-I 
biosynthesis. Preliminary studies suggest that only 
one human IGF-I gene exits per haploid genome. 

30 Ullrich et al . (1984); Brissenden et al . (1984); and 
Tricoli et al . (1984). Studies of IGF-I biosynthesis 
have been hampered by a very low IGF-I content in 
tissue, Vasilopbulou-Sellin and Phillips (1982) and 
because, in contrast to IGF-I I, no cultured cell 

35 lines have been identified which elaborate significant 
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To date, several classes of IGF's have been 
identified. These include IGF- I (Somatomedin C), 
IGF-II, Somatomedin A, and a mixture of peptides 
called multiplication-stimulating activity (MSA). 
5 This heterologous group of peptides exhibit important 
growth-promoting effects in vitro , Daughaday (1977); 
Clemmons and Van Wyk (1981a), and in vivo , van Buul- 
Offers and Van den Brande (1980), Schoenle et al. 
(1982). 

10 Two human IGF's have been characterized. 

These are IGF- I, comprising a 70 amino acid basic 
protein, Rinderknecht and Humbel (1978a), Rubin et al . 
(1982), and IGF-II, comprising a 67 amino acid neutral 
peptide, Rinderknecht and Humbel (1978b); Marguardt 

15 and Todaro (1981). Whereas the complete amino acid 

sequences have only been determined for rat and human 
IGF-I and IGF-II, Humbel, R. E. (1984), a high degree 
of- homology and/or cross-reactivity has been shown by 
radioimmunoassay and/or radioreceptor assay to exist 

20 among IGF-I's and among IGF-I 1 1 s from different 
species. Wilson and Hintz (1982). 

Circulating levels of these peptides appear 
to be under the control of growth hormone to a greater 
or lesser extent with IGF-I being controlled to a 

25 greater extent than IGF-II. IGF-I, for example, 

plays a fundamental role in postnatal mammalian growth 
as a major mediator of growth hormone action. See 
Copeland et al . (1980) and Schoenle et al . (1982). 

Vassilopoulou-Sellin and Phillips (1982) 

30 have estimated, using molecular sieve chromatography, 
that IGF-I activity, assayed both in vitro and in 
vivo , extracted from rat liver has a higher molecular 
weight (approximately 30 kilodaltons) than activity 
extracted from plasma (approximately 8 kilodaltons). 

35 The authors -suggested that the higher molecular 
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quantities of this peptide. Clemmons and Van Wyk 
(1981b); Clemmons and Shaw (1983). Additionally, 
neither the complete human IGF-I gene has been iso- 
lated nor the complete DNA sequence determined. 
5 Preliminary studies by Bell et al. (1985) suggest that 
the human IGF-I gene is at least 3 5 kilobases (kb) in 
length, of which only 210 base pairs (bp) encodes 
mature human IGF-I. The studies by Bell et al . (1985) 
also suggest that the human IGF-I gene contains only 

10 four exons, which together encode a single precursor 
IGF-I protein. The large size (e.g. greater than 35 
kb) and complexity of this gene relative to the mature 
IGF-I coding sequences has made both isolation and 
identification of the complete genomic DNA sequences 

15 extremely difficult. Isolation of a genomic IGF-I 
clone (e.g. IGF-I gene) would greatly facilitate 
studies of IGF-I biosynthesis and provide a means for 
identifying and producing precursor, mature and/or 
intermediate IGF-I species and allelic variants 

2 0 thereof. A genomic clone thus facilitates determining 

which proteins are believed to be active growth 
promoting peptides . 

; : Accdrdingiy , it '.is • an object'; of -this inven- 
tion to .provide a highly purified gene and/or syri-,; 
25 thetic DNA sequences encoding an IGF-I precursor 

protein (preproinsulin-like growth factor-I), and/or 
peptide fragments thereof and useful in making such a 
protein and/or fragments . 

It is another object of this invention to 

3 0 provide processes utilizing such DNA in the production 

of such proteins and peptides. 

Another object of this invention is to 
provide the amino acid sequence } of a novel 
preproinsulin-like growth factor-I and methods using 
35 such a protein or fragments thereof to promote 

desirable growth or functionality of cells in animals.. 
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Summary of the Invention 

The present invention relates to the 
discovery of a novel preproinsulin-like growth 
factor- I- (ppIGF-I) protein, an essentially pure human 
IGF- 1 gene and synthetic DNA sequences which encode 
the novel ppIGF-I protein. 

In one embodiment, the present invention 
provides an essentially pure IGF-I gene encoding at 
least two. ppIGF-I proteins and containing the sequence 
of nucleotides (or their functional equivalents for 
protein expression) as given in FIG. 4. 

In another embodiment, the invention pro- 
vides synthetic DNA sequences encoding a novel 
ppIGF-I protein. One such synthetic DNA sequence 
contains the sequence of nucleotides (or their func- 
tional equivalents for protein expression) as given in 
FIG. 2. 

In still another embodiment, the present 
invention provides an essentially pure DNA sequence 
encoding a carboxyl-terminal extension of the novel 
ppIGF-I protein. One such essentially pure DNA 
sequence contains the following sequence of 
nucleotides (or their functional equivalents for 
peptide expression) : 

5 ' TATC AG CCCC CAT C T AC C AAC AAGAA C ACGAAGT CT C AGAGAAGG AAAGGTT 
GGCCAAAGACACATCCAGGAGGGGAACAGAAGGAGGGGACAGAAGCAAGTCTGC 

AGATCAGAGGAAAGAAGAAAGAGCAGAGGAGGGAGATTGGAAGTAGAATGCTGAA 
TGCAGAGGCAAAAAAGGAAAATGA3 ' 



30 



In yet another embodiment, the invention 
provides a novel ppIGF-I protein having the amino 
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acid sequence as given in FIG* 6. 

In a further embodiment, the present inven- 
tion provides a novel extension peptide having the 
following amino acid sequence: 

5 NH -Tyr Gin Pro Pro Ser Thr Asn LysfJ^n^ht Lys Ser Gin Arg Arg * 

li^sfGly Trp Pro Lys Thr His Pro Gly "Siy^Blu Gin Lys Glu Gly Thr 

Glu Ala Ser Leu Gin He Arg Gly Lys Lys Lys Glu Gin Arg Arg Glu 

He Gly Ser Arg Asn Ala Glu Cys Arg Gly Lys Lys Gly Lys - COOH. 

This extension peptide represents a carboxyl- 
10 terminal extension portion of the novel ppIGF-I 
protein. 

In other embodiments, the invention provides 

various methods for promoting growth and/or other 

desirable functions of cells in animals by admini- 
15 stering the novel proteins and/or peptides of this 

invention to animals in amounts sufficient to cause 

such effects. 

In other embodiments, the present invention 

provides methods for producing the essentially pure 
20 proteins of this invention by causing expression of 

the gene and/or synthetic DNA sequences provided 

herein. 

Brief Description of the Drawings 

In the following diagrammatic representa- 
25 tions, amino acid sequences are provided reading from 
the amino terminus (NH 2 ) to the carboxy terminus 
(COOH) of the sequence. The DNA sequences are 
provided in a 5 r to 3 1 orientation. Relevant 
restriction endonuclease sites are also shown. The 
3 0 DNA regions marked as described below are for purposes 
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of diagrammatic representation only and are not drawn 
to scale unless otherwise noted. The numbering of the 
DNA sequences from 5' to 3' and of the amino acid 
sequences from the amino terminus to carboxy terminus 
is for diagrammatic purposes only. 



FIG. 1 depicts a map of a human IGF-1B cDNA clone. 

The horizontal directional arrows denote 
the approach to DNA sequencing of the cDNA: 
the open box denotes the 585 base open 

10 reading frame and wherein the cross hatched 

box denotes the DNA sequence encoding the 
mature human IGF- 1 peptide; the thin lines 
denote the 5 1 and 3 f untranslated regions 
of the 1136 nucleotide cDNA. Below the 

15 ma P of the human IGF- IB cDNA are represen- 

tative cDNA clones denoted AIGF-2 and 
AIGF-5. The "A 42 " on the AIGF-5 clone 
denotes a poly A coding sequence consisting 
of 42 adenosine (A) bases. 

20 FIG. 2 depicts the DNA sequence of the IGF-1B cDNA 

clone (1136 nucleotides) and a translation 
of the 585 base open reading frame starting 
at nucleotide 183 (e. g. ppIGF-lB amino acid 
sequence.) Termination codons are denoted 

25 by ### and ***. The broken underlined 

region denotes the nucleotides encoding the 
mature human IGF-I peptide (nucleotide 327 
through nucleotide 536) and attendant amino 
acid sequence of mature human IGF-I. The 

30 solid underlined region denotes the nucleo- 

tides encoding the IGF-1B carboxyl-terminal 
extension peptide and attendant amino acid 
sequence of said extension peptide. 



10 
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FIG- 3(a) depicts the structure and restriction 

map of the human" IGF- I gene. The 5 1 
to 3 f directional arrow indicates the 
5 ( to 3' orientation of the IGF-I gene. 
The thin lines denote introns and 
flanking regions; the ff denotes a 
region of an intron not isolated by 
molecular cloning; the solid boxes (1, 
2, 3, 4, 5) denote coding regions and 
the cross-hatched boxes denote non- 
coding regions; the vertical arrows 
under exons 4 and 5 denote the polya- 
denylation sites. The regions marked 
"Alu" denote hybridization to middle 
15 repetitive DNA of the Alu type. 

Restriction sites are denoted by "B" = 
BamHI, "E" = EcoRI , "H" = Hindlll. 
Below the restriction map are represen- 
tative genomic clones . 

20 

(b) depicts detailed restriction maps of 
IGF-I exons 1 to 5 . The horizontal 
directional arrows denote DNA sequencing 
strategy; the boxes denote exons 
wherein the solid regions denote coding 
regions and the crosshatched regions 
denote noncoding regions. The vertical 
arrows under exons 4 and 5 denote the 
polyadenylation sites. 

30 FIG. 4 depicts the DNA sequence of the human 

IGF-I gene. Exons are in upper case 
and introns and flanking DNA are in 
lower case. translation of the 
ppIGF-I proteins (e.g. amino acid 



25 
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sequence of ppIGF-lA and ppIGF-lB) 
are indicated and carboxyl- terminal 
extensions of the pIGF-lA and pIGF-lB 
are denoted by solid overlines labeled 
1A and IB, respectively. ■ The 70 amino 
acids of mature IGF-I are denoted by a 
broken underline. m frame translation 
termination codons, TGA and TAA, 
preceding the translation initiation 
start-signal codon, ATG, are denoted 
by ***. a putative transcription 
start site is denoted by AA . 
Polyadenylation signals AATAAA and 
AATATA are denoted by a solid overline 
and are followed by poly A addition 
sites, denoted "poly A A", in exons 
4 and 5, respectively. The dotted line 
denotes unseguenced regions the lengths 
of which are marked thereon in kb 
20 (kilobases). 

FIG ' 5 depicts a comparison of the IGF-1A and 

IGF-lB cDNA sequences. The top line 
represents IGF-lB. The bottom line 
represents IGF-1A. The terminal (5' 
and 3 1 ) nucleotides are denoted by 
an "X" . 



15 



25 



FIG. 6 



depicts the amino acid sequence of the 
ppIGF-lB protein. 
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Detailed Description of the In vention 

As used herein, the symbols representing 
amino acids (e.g. Ala for alanine) and nucleotides 
(G,C,A,T) are those conventionally employed. See 
Lehninger (1976). 

As used herein, "gene" refers to a region 
of chromosomal DNA which region contains DNA sequences 
encoding a given protein. Typically, eucaryotic 
genes contain both introns and exons . The term 
"exon" refers to those gene (e.g. DNA) sequences 
which are transcribed into RNA. A given exon can, 
therefore, comprise both protein encoding (coding) 
and non-coding DNA sequences wherein both the coding 
and non-coding DNA sequences are transcribed into RNA 
but only the coding sequences are translated into 
protein. The term "intron" refers to a DNA sequence 
present in a given gene which is neither transcribed 
nor translated and is generally found between exons 
(hence the name "intron"). 

The term "essentially pure," when used 
herein, to describe a protein, peptide, nucleic acid 
(DNA or RNA) sequence, gene and/or molecule, means 
substantially free from proteins, peptides, nucleic 
acid sequences, genes and/or molecules with which the 
described protein, peptide, nucleic acid sequence, 
gene and/or molecule is associated in its natural 

(e.g. in vivo ) state. 

The term "synthetic" when used herein to 
describe a protein or peptide means a protein or 
peptide produced by a technique (e.g. chemical or 
enzymatic synthesis or recombinant DNA expression) 
other than its natural production in an animal. 
Thus, as produced, "synthetic proteins or peptides" 
are typically essentially pure (e.g. free from 
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proteins or peptides of natural origin). Similarly, 
with reference to DNA sequences or molecules, the 
term "synthetic" means that such DNA sequences or 
molecules have been made by any technique other than 
their natural replication in vivo. Thus , utilizing 
the nucleotide sequences described herein, any DNA 
sequence of this invention can be prepared by various 
"synthetic" techniques known to those skilled in the 
art such as, but not limited to, automated DNA 
synthesizing equipment, other chemical synthesis 
procedures, enzymatic isolation, cDNA synthesis 
or cloning techniques. 

The term "carboxyl -terminal extension 
peptide" or, alternatively, "extension peptide" as 
15 used herein means a peptide encoded in the DNA 
sequences down stream (3-) to the DNA sequence 
encoding mature IGF-I . For example, one extension 
peptide of the present invention has the following 
amino acid sequence: 

20 NH2 - Ar 8- Ser - Va l-Ar 8 -Ala-Gln-Arg-His-Thr-Asp-Met 

Pro-Iys-Thr-Gln-Lys- jfyr-Gln-Pro-Pro-Ser-Thr-A^n 
Ly S -Asn-Thr-r.^-S^^ -Gl n - Arg . Arg _ L ^l Glv _ Tr? _ prn 

Lys-Thr-His^Pro-Glv-n iy-Glu-Gln-Lys-Glu-Gly -Thr 
Glu-Ala-Ser-Leu-Gln-T l e-Ar g -Glv-Ly S -L vs -L vs - Glll 

Gln-Arg-Arg-Glu-Ile-Glv- Ser-Are-Asn-Ala-Glu-Cvs 
Arg-Gly-Lys-L ys-Gly-Lys-COOH 



25 



30 



A second extension peptide consists of the 
underlined amino acids in the above sequence. This 
second extension peptide is encoded in the novel 
IGF- I exon 4 of the present invention. These 
extension peptides are believed to possess and/or 
confer, to proteins encoded in the IGF- I gene, an 
enhanced biological activity. 
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It is also understood by those of skill in 
the art, that references to the proteins of this 
invention by their respective amino acid sequences 
embraces various amino acid substitutions, additions 

5 and/or deletions so long as the activity of the 

protein is substantially maintained. Similarly, the 
essentially pure and synthetic DNA sequences and 
genes identified by the present invention are 
. understood to embrace various nucleotide sequence 

10 variations such as nucleotide deletions, substitu- 
tions, inversions, additions, allelic variations 
and/or functional equivalents for expression of 
protein(s) encoded therein so long as the activity of 
the protein(s) encoded therein is substantially 

15 maintained. Such amino acid and nucleotide sequence 

variations are well known to those of skill in the art 
and can be created by such conventional techniques as 
de novo chemical synthesis, enzymatic manipulations 
and recombinant DNA techniques. It is anticipated, 

20 for example, that alterations of the specific DNA 

sequences and/or genes of the present invention can be 
made in accordance with known genetic code degenera- 
cies to meet the codon and/or expression preferences 
of specific host cells or organisms employed to 

25 synthesize protein(s) of the present invention. 

DNA of the Invention 

In one embodiment of the present invention, 
a novel IGF- I specific cDNA, designated herein as 
IGF-1B cDNA, was identified and isolated. The 
30 nucleotide sequence of this IGF-1B cDNA was 

determined to be as shown in FIG. 2. Further DNA 
sequence analysis of the IGF-1B cDNA revealed a 585 
nucleotide open reading frame (nucleotides 183 to 
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767, FIG. 2) which encodes a novel 195 amino acid 
preproinsulin-like growth factor- I protein designated 
herein as ppIGF-lB. The amino acid sequence of the 
novel ppIGF-lB is as shown in FIG. 6. As described 
more fully below and shown in FIG. 5, a comparison 
between the IGF-1B cDNA and the ppIGF-I cDNA 
published by Jansen et al . (1983) revealed the IGF-1B 
cDNA to be a novel and distinct synthetic DNA 
sequence encoding a novel and distinct ppIGF-I 
protein (e.g. ppIGF-lB). Specifically, while the 
IGF-1B cDNA sequence was determined to be essentially 
homologous to the sequence of Jansen et al . (1983) 
for the first 402 nucleotides, the nucleotide 
sequence then sharply diverges. The Jansen et al - 
(1983) and IGF-1B cDNAs thus give rise (e.g. encode) 
two different and distinct ppIGF-I proteins differing 
in both size (195 amino acids for IGF-1B versus 153 
amino acids) and carboxyl-terminal extension peptide 
composition, the significance of which is discussed 
more fully below. 

The novel IGF- IB cDNA was demonstrated to 
be useful in producing ppIGF-lB in vitro and is 
anticipated to provide a synthetic DNA sequence 
useful for production of ppIGF-lB, or any portion 
thereof, in suitable host cells (i.e. bacterial, 
yeast, mammalian or plant) by conventional 
recombinant DNA techniques and genetic engineering. 
It is also anticipated that in selected mammalian 
cells, the synthetic IGF-1B DNA sequence may be 
useful in producing mature IGF- I as a result of 
cellular or post-production processing of the 
ppIGF-lB to mature IGF-I. 

Having now provided the DNA sequence of the 
IGF-1B cDNA (see FIG. 2), synthetic DNA molecules 
comprising said sequence can now be created by such 
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conventional means as automated DNA synthesis . 

In one embodiment of the present invention/ 
the IGF-1B cDNA was isolated f rom * a cDNA library- 
constructed from human liver messenger RNA (mRNA) . 
5 The cDNA library was constructed and screened for 

clones containing IGF- I cDNA in accordance with the 
methods described by Kwok et al . (1985) and Gubler and 
Hoffman (1983). Although the liver is believed to be 
the major site of IGF production, many other tissues, 

10 for example fetal tissues and fibroblasts are believed 
to synthesize IGF 1 s as well. Thus, such other tissues 
may alternatively be employed to construct a human 
cDNA library. 

• In one preferred embodiment of the present 

15 invention, an oligonucleotide probe corresponding to 
a mature IGF- I coding sequence was employed to screen 
the human cDNA library for the presence analysis of 
clones containing IGF-I specific cDNA sequences. 
Specifically, the probe comprised a 42 nucleotide 

20 molecule (42 mer) corresponding to the DNA sequence 
encoding amino acids 10 through 23 (see amino acids 
58-70 in FIG. 6) of mature, human IGF-I. Such a probe 
can be used to isolate genomic, cDNA and/or RNA 
sequences encoding all or a portion of mature IGF-I. 

25 Any other region of the mature IGF-I DNA coding 

sequence may alternatively be employed. Additionally, 
the length (e.g. number of nucleotides) of such an 
oligonucleotide probe may vary depending upon the 
stringency of the hybridization conditions employed. 

30 Typically, such an oligonucleotide probe should 
consist of at least 13 or 14 nucleotides sharing 
homology with the DNA or RNA sequence(s) sought to 
be identified. 
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The screening of a cDNA library with a pr-VDs 
directed to mature IGF-I allows for the identification 
of potentially all IGF-I messenger RNAs (mRNAs) actively 
expressed in the tissue from which the cDNA library 
was constructed. The translation of such mRNAs can 
include, but are not limited to mature IGF-I, IGF-I 
precursors which contain at least a portion of the 
mature IGF-I protein, and any other proteins containing 
at least a portion of the mature IGF-I protein at the 
mRNA level of said proteins' biosynthesis. Although, 
the preferred embodiment employed a cDNA analysis of 
IGF-I gene products, one may alternatively screen 
cellular mRNA by Northern Blot analysis in accordance 
with methodologies known to those skilled in the art 
to identify IGF-I - specific mRNAs. See Thomas, 
P.S. (1980) . 

As described more fully below, clones 
containing IGF-I specific DNA sequences were then 
isolated and cleaved with various restriction endo- 
nucleases to determine both the size and regions 
of the IGF-I specific cDNA inserts contained therein. 
Any restriction enzymes may be employed. see Maniatis 
et al. (1982). By these means, IGF-I cDNA inserts 
ranging from about 800 to about 1150 nucleotide pairs 
in size were identified and isolated. The subsequent 
restriction mapping of these DNA inserts, unexpectedly 
revealed that the isolated inserts were of two types, 
designated herein as IGF-1A cDNA (IGF-1A) and IGF-1B 
cDNA (IGF-1B), respectively. This discovery of a novel 
ppIGF-I protein coding sequence is significant as it 
now provides a means for identifying and isolating 
novel IGF-I gene sequences, production of a synthetic, 
essentially pure, novel ppIGF-I protein and means by 
which mature IGF-I biosynthesis can be studied and 
possibly manipulated to achieve desirable biological 
activities and/or effects . 
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DNA sequence analysis of these two cDNAs 
was then performed and the respective sequences 
compared as shown in FIG. 5. As previously 
discussed, the IGF-1B cDNA is a novel and distinct, 
synthetic IGF- I DNA sequence which encodes a ppIGF-I 
protein, referred to as ppIGF-lB. The IGF-1A cDNA was 
determined to be equivalent to the cDNA sequence 
isolated by Jansen et al. (1983). Significantly, both 
the IGF-I cDNA's (-1A and -IB) isolated by the methods 
of the present invention contained additional 5 • -end 
sequences which now enable an accurate determination 
of all possible transnational starts of the precursor 
protein(s) (ppIGF-I) encoded therein. 

Specifically, as shown in FIG. 2, four 
possible translation start-signal codons (ATG's), 
in- frame with the DNA sequence encoding mature IGF-I, 
were identified.. The first ATG codon begins at 
nucleotide 84, the second at 183, the third at 252 and 
the fourth at 261. As shown in FIG. 2, the novel 5' 
IGF-1B cDNA sequences of the present invention also 
indicate the presence of two in-frame translation 
termination codons (denoted ### and ***) upstream from 
(5' to) and in- frame with the second ATG codon 
beginning at nucleotide 183 thereby suggesting that 
nucleotides 183 to 185 represent the first operable 
translation start-signal codon (ATG) for the P pIGF-I 
protein(s). Indeed, as shown more fully in the 
examples below, nucleotides 183 to 185 do, m fact, 
constitute an operable translation start-signal 
codon (ATG) for the ppIGF-I proteins encoded m both 
the IGF-1A and. -IB synthetic DNA sequences. 
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This finding is significant as both defini- 
tive translation start-signal codons and translation 
termination codons should be identified within a given 
gene or synthetic DNA sequence before the amino acid 
sequence of a given protein or proteins encoded 
therein may be deduced and/or subsequently produced 
therefrom/ The IGF-I gene and synthetic DNA sequences 
provided by the present invention represent the first 
such delineation of the operable translation start- 
signal codons for both the newly discovered IGF- IB 
encoded ppIGF-I protein (ppIGF-lB) and for the IGF-I 
precursor protein proposed by Jansen et al . (1983). 
Furthermore, the identification of the translational 
start for the ppIGF-I protein(s) now enables quanti- 
tative and qualitative production of synthetic 
ppIGF-I protein(s) by such conventional means as 
chemical, enzymatic and/or recombinant DNA methodo- 
logies . 

Indeed, as described in the examples below, 
the. 2 -ppIGF-I proteins of the present invention can be 
effectively produced by causing expression of the 
ppIGF-I cDNAs r identified and isolated herein, in an 
in vitro cell lysate expression system. Alternatively, 
ppIGF-lA and -IB proteins and/or fragments thereof can 
be produced by recombinant DNA (rDNA) techniques. 
Recombinant DNA production methodologies would involve 
the cloning, by conventional techniques, of at least 
the protein coding sequences (e.g. at least nucleotides 
183 to about 767, FIG. 2, for IGF-1B) into cellular 
(bacterial, yeast, mammalian and/or plant) expression 
vectors, available to those of skill in the art, and 
subsequent production (e.g. expression of the ppIGF-I 
DNA coding sequences) of the ppIGF-I protein in the 
selected host cell. The proteins so produced can then 
be isolated by conventional techniques. By these and 
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other conventional means, such as, but not limited to 
chemical synthesis, the DNA sequence information and 
synthetic DNA sequences containing all or part of the 
IGF-1A and IGF-1B cDNA's, can be employed to produce 
essentially pure mature IGF-I, ppIGF-I (e.g. ppIGF-lB) 
and/or an extension peptide. 

Additionally, the DNA sequences and genes 
of the present invention enable those skilled in the 
art to more effectively study and/or manipulate mature 
IGF-I biosynthesis and biological regulation. 
Furthermore, all or portions of the IGF-I gene or 
DNA sequences of the present invention can be employed 
to identify equivalent DNA molecules, RNA molecules, 
gene(s) and/or allelic variations thereof in human 
and other species (i.e. bovine, porcine and/or avian) 
wherein sufficient homology in such DNA and/or 
RNA sequences exists. Indeed, an example of such RNA 
molecules embraced by the present invention include 
the 1-7, 3.7 and 6.3 kb (kilobase) polyadenylated 
IGF-I RNAs identified by hybridization with the 
IGF-1B cDNA and/or a synthetic DNA encoding the novel 
extension peptide. The degree of homology necessary 
to identify such DNA and/or RNA sequences or 
molecules is well known to those skilled in the art 
and may, for example, vary depending upon the strin- 
gency of the hybridization conditions employed and/or 
relative sizes of the genes or nucleic acid sequences 

and probe employed. 

With regard to allelic variations of the 
gene and DNA sequences provided herein, the (IGF-1A) 
cDNA of Jansen et al. (1983) was determined to main- 
tain an identity (e.g. precise homology) with the 
novel IGF-1B for 413 nucleotides, except for one 
nucleotide difference in the Jansen et al . (1983) 
cDNA sequence, that difference being a conservative 
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third position change in a glycine codon (nucleotide 
452 of the IGF-1B sequence). Such position changes 
in a given codon may result from and/or exemplify 
allelic variation. Such allelic variations as they 
may occur in the IGF-1B DNA sequences of the present 
invention, are considered to represent equivalents 
of the IGF-I gene and DNA sequences provided herein. 

As shown in FIG. 5, after identity for 459 
nucleotides, the IGF-1A and IGF-1B DNA f s then diverge. 
Specifically, when analyzed in terms of protein 
sequence, shown in FIG. 2, the point of divergence 
follows a lysine residue 16 amino acids after the 
mature IGF-I region. As described more fully below, 
the point of divergence corresponds to an exon-intron 
junction present in the IGF-I gene structure shown in 
FIG. 3- The IGF-1A cDNA sequence encodes an addi- 
tional 19 amino acids past the point of the IGF-1A and 
-IB divergence and thereby encodes a ppIGF-I protein 
which may be as long as 153 amino acids in length, 
hereinafter referred to as ppIGF-lA. The IGF-1B 
cDNA sequence contains an additional 61 amino acids 
past the point of the IGF-1A and -IB divergence and 
thereby encodes a ppIGF-I protein which may be as 
large as about 195 amino acids in length, referred 
to as ppIGF-lB. The two carboxyl-terminal extensions 
consisting essentially of the 19 amino acids of 
IGF-1A and 61 amino acids of IGF-1B, respectively, 
show no amino acid homology with each other. Addi- 
tionally, neither carboxyl-terminal extension peptide 
showed homology with any other protein in the National 
Biomedical Research Foundation Protein Sequence Data 
Bank. Dayhoff et al . (1983) and Wilbur and Lipman 
(1983). These differences coupled to the discovered 
existence of two distinct ppIGF-I cDNAs verified by 
the subsequent identification in vivo of corresponding 
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mRNA molecules (see examples, below) confirms that the 
synthetic ppIGF-lB DNA sequence provided in the 
present invention represents a novel ppIGF-I cDNA 
encoding a novel and distinct ppIGF-I protein 
(ppIGF-lB) and extension peptide(s). 

In another important embodiment of the 
present invention, a novel human IGF-I gene was 
identified and isolated. Unlike the gene identified 
by Bell et al- (1985), the IGF-I gene of this inven- 
tion contains 5 exons and encodes at least two 
ppIGF-I proteins. The gene structure and DNA sequence 
of the novel IGF-I gene is shown in FIGS. 3 and 4, 
respectively. While exons 1, 2, 3 and 5 as shown in 
FIG. 3 were structurally identified by Bell et al . 
(1985), the complete DNA sequences of these exons were 
not provided. These sequences are essential for 
determining the number and amino acid compos ition( s ) 
of possible ppIGF-I protein(s) encoded in the IGF-I 
gene. Furthermore, the sequence information is 
essential for subsequent employment of the gene or 
portions thereof (e.g. specific exons or intron-exon 
combinations ) in in vitro or in vivo protein produc- 

tion systems . 

For purposes of synthesizing ppIGF-I pro- 
teins, IGF-I gene related proteins and mature IGF-I 
proteins in, for example, mammalian systems, it may be 
desirable to employ an essentially pure IGF-I gene, 
or portions (e.g. exons) thereof. Specifically, it 
may be desirable to employ IGF-I specific DNA sequence 
comprising intron and exon regions. Alternatively, 
employment of essentially intron- free DNA sequences 
derived from the IGF-I gene provided herein may be 
desirably employed to product ppIGF-I proteins or 
extension peptides . For these reasons and for purpose 
of positioning the novel DNA sequences of the present 
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invention within the IGF- I gene sequence, we isolated 
and cloned IGF-I gene sequences. 

In one preferred embodiment of the present 
invention, two genomic libraries were screened for 
IGF-I gene inserts employing IGF-1A and -IB cDNA's or 
fragments thereof. As described more fully below and 
shown in FIGS. 3 and 4, we identified five exons 
whereas only 4 exons had previously been reported. 
See Bell et al . (1985). The newly discovered exon 4 
(see FIGS. 3 and 4) was determined to encode a 
carboxyl-terminal extension of ppIGF-lB whereas exon 
5 encodes a carboxyl-terminal extension of ppIGF-lA. 

Exon 1 encodes at least 21 amino acids 
which may comprise a leader peptide enabling secretion 
of IGF-I. IGF-I is known to be secreted by the cells 
producing it hence the term "preproinsulin-like growth 
factor-I w (ppIGF-I) to denote an IGF-I precursor 
protein. Exon 2 is about 157 nucleotides in length 
and encodes 52 amino acids and is identical in DNA 
sequence to an IGF-I exon reported by Ullrich et al . 
(1984). The initial 27 amino acid residues encoded in 
exon 2 precede the start of the mature IGF-I B domain 
and contain two methionine residues which may serve as 
additional translation start signals. Exon 2. also 
encodes the beginning 25 amino acids of the B domain 
of mature IGF-I. Codon 26 was found to be interrupted 
by a large intron of at least 21 kb . The remainder of 
the genetic information coding for the mature IGF-I 
protein resides in exon 3. Specifically, exon 3 
contains 11 nucleotides encoding the rest of the B 
domain, 12 codons encoding the C domain, 21 codons 
encoding the A domain and 8 codons encoding the D 
domain. In addition, exon 3 codes for 16 additional 
amino acids which may represent a newly described E 
domain and/or a portion of the carboxyl-terminal 
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extension of plGF-lA and pIGF-lB. Thus, the 
"carboxyl- terminal extension proteins" or "extension 
peptides- of the present invention can comprise a 
combination of the last 16 amino acids encoded m 
exon 3 (e.g. NH 2 -Arg Ser Val Arg Ala Gin Arg His 
Thr Asp Met Pro Lys Thr Gin Lys-COOH) and the 
peptide encoded in exon 4, the peptide encoded m 
exon 4 and/or any series of amino acids encoded in the 
DNA sequences found within exon 4 and the last 48 

nucleotides of exon 3. 

Exons 4 and 5 were found to each encode 
distinct extension peptides, translation termina- 
tion codons, 3' nontranslated regions, poly A 
addition (e.g. adenylation) signals and sites Exon 
4 located 1 . 5 kb 3 ' to exon 3, consists of 515 
nucleotides and comprises the 3 « -end (e.g. carboxyl- 
terminus) of the IGF-1B coding sequence. The 
extension peptide encoded in exon 4 comprises a highly 
basic 61 amino acid peptide with several dibasic and 
tribasic residues. Based upon analogies with other 
biosynthetic pathways employing precursor protein 
intermediates (e.g. conversion of proinsulin to 
insulin), these dibasic and tribasic residues may 
provide for the enzymatic processing of the ppIGF-I 
protein(s) encoded in the gene or genes of the present 
invention into, for example, a mature IGF-I protein. 
Additionally or alternatively these residues may 
facilitate binding to such negatively charged macro- 
molecules as DNA. Following the TGA termination 
codon in exon 4, are 329 bases of 3' nontranslated 
DNA. Exon 5, located 17 kb downstream (3') from 
exon 3 is 344 bases long, encodes the 3 • -end of the 
IGF -1A coding sequence and contains a TAG translation 
termination codon and 284 bases of 3- nontranslated 
DNA Additionally, all intron-exon borders contain 
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consensus RNA splicing sequences indicating that any 
combination of. exons may exist within a mRNA molecule 
and, hence, be translated into an IGF-I gene related 
protein. 

The genes and DNA sequences provided herein 
now allow the production, in vitro , by such techniques 
as recombinant DNA and chemical synthesis, and the 
detection, in vivo , of proteins comprising a single 
exon encoded protein and/or any combination of exon 
encoded proteins. Such proteins are believed to be 
biologically active and, specifically, to possess 
IGF- I -like activity- As such, the attendant biologi- 
cal activities maybe ascertained by those skilled in 
the art employing in vitro and/or in vivo assays . see 
Daughaday (1977); Clemmons and Van Wyk (1981a); van 
Buul-Offers and Van de Brande (1980); Schoenle et al . 
(1982); and Hylka et al . (1985). 

Furthermore, the isolation, restriction map 
and determination of the IGF-I gene structure and 
sequence is significant as it now provides an oppor- 
tunity to localize DNA polymorphisms detected by, for 
example, chromosomal restriction endonuclease analy- 
sis. Indeed, we have localized the site polymorphisms 
for restriction endonucleases Hind III and Pvu II 
reported by Bell et al . (1985) to be linked to one 
another and to map to exon 5 of the genomic IGF-I 
gene. Such polymorphisms are useful in establishing 
familial genetic linkages and have been employed 
diagnostically for various diseases. Additionally, 
such IGF-I gene polymorphisms may be employed 
diagnostically for growth-affected conditions. 
Furthermore, such polymorphisms are considered to 
constitute equivalents of the gene sequences provided 
herein. 
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The present invention further contemplates 
that the essentially pure IGF-I gene of the present 
invention or portions (e.g. exons) thereof may now 
be employed to generate, de novo, and/or amplify, 
in vivo, IGF— I gene related proteins including but 
not limited to ppIGF-I proteins, mature IGF-I 
proteins, variants thereof and/or biosynthetic 
intermediates thereof. 



The Proteins 



in one embodiment, the present invention is 
directed to a novel ppIGF-I protein (ppIOF-lB) having 
an amino acid sequence as shown in FIG. 6. In 
another important embodiment, the present invention 
is directed to novel extension peptides exemplified 
by a peptide having the amino acid sequence as shown 
in FIG. 7. Based upon the sequences provided herein, 
these proteins and equivalents thereof can now be 
made by such conventional means as chemical synthesis, 
in vitro enzymatic synthesis, rDNA techniques and/or 
isolation from human tissue. 

It is anticipated that production of the 
proteins or peptides of this invention by rDNA and/or 
chemical synthesis may result in minor alterations in 
amino acid composition. For example, production m 
bacteria may result in addition of a methionine at the 
amino (NH 2 )- terminus and chemical synthesis may result 
in variations of the carboxy (-COOH) terminus such 
that any of the radicals -COOR, -CRO, -CONHNR, , 
-ConRxRa or -CH 2 OR (R x and R 2 being independently 
lower alkyl or hydrogen) may be found. such essen- 
tially pure and/or synthetic proteins are considered 
within the scope those of the present invention. 



~ 25 - 07-24(401)A 

Furthermore, the novel DNA sequences and 
genes provided by the present invention now enable 
those skilled in the art to identify, isolate and/or 
produce ppIGF-I proteins and/or biologically active 
fragments thereof including, but not limited to, 
mature IGF-I and extension peptides. The biological 
activity of these proteins, may include, but is not 
limited to, the growth promoting activit(ies) of 
mature IGF-I and can, therefore, be ascertained in 
accordance with the in vitro and in vivo assays known 
to those skilled in the art. 

Specifically, IGF-I has been shown to 
stimulate cell proliferation, skeletal growth, weight 
gain and mammary gland growth, the latter being a 
prerequisite for enhanced milk production. It is 
anticipated that the novel ppIGF-lB protein described 
herein shares all or some of such growth promoting 
activities and can exhibit enhanced activity in 
selected tissue. 

It is also anticipated that biologically 
active fragments of the proteins of this invention can 
be made by conventional means. These fragments are 
hereinafter sometimes referred to as " IGF-I gene 
related proteins" and are understood to comprise 
proteins wherein at least a portion of said protein(s) 
is encoded in the gene and/or synthetic DNA sequences 
of the present invention and/or allelic variations 
thereof. 

In one approach, the amino acid sequences 
of the ppIGF-I (e.g. ppIGF-lA and ppIGF-lB) proteins 
and extension peptides thereof were deduced from the 
DNA sequences of their respective cDNAs and from the 
novel IGF-I gene provided by the present invention. 
For example, the ppIGF-lB protein was determined to 
have the amino acid sequence as given in FIG. 6. 
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in another approach, the ppIGF-I proteins 
were identified by employing in vitro systems which 
provided for expression of the IGF-1A and -IB cDNAs 
The term "expression" as used herein when referring to 
a gene or synthetic DNA sequence means transcription 
of a given gene or DNA sequence into mRNA followed 
by translation of the mRNA into protein. The proteins 
so produced can thereafter be isolated by such conven- 
tional techniques as polyacrylamide gel electrophoresis, 
affinity chromatography, ion-exchange, reverse 
phase HPLC, immuno-precipitation or any other conven- 
tional fractionation technique and then sequenced. 
By these methods, it was demonstrated that a 195 ammo 
acid-ppIGF-lB protein and a 153 amino acid ppIGF-lA 
protein can be produced in vitro, that these proteins 
corresponded to their respective amino acid sequences 
deduced from their respective cDNA sequences identi- 
fied herein and that the first in- frame methionine 
codon [e.g. base pairs 183-185 for IGF-1B (see FIG^>] 
is the major translation start-signal codon for both 

ppIGF-I proteins. 

in order to confirm that the proteins (e.g. 
ppIGF-lA and ppIGF-lB) encoded in the synthetic 
IGF-1A and-lB DNA sequences and/or the carboxyl- 
terminal extensions thereof are produced as distinct 
and separate peptides and/or as part of separate and 
distinct proteins in vivo, we did the ^llowxng. 
Radiolabeled complete IGF-1A cDNA or complete IGF-1B 

cDNA and radiolabeled DNA's corresponding to the 

coding sequences contained in either exon 4 or ex. ; 

were individually hybridized to RNA transcripts (mRNA) 

produced in human liver. 
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By these means, we identified polyadeny- 
lated RNA molecules ranging from about 900 to about 
1350 nucleotides in length which hybridized to all of 
the aforementioned probes . These results confirm the 
5 active transcription of DNA encoding both the IGF-1A 
and -IB DNA sequences in vivo > Furthermore, we 
identified three additional polyadenylated RNAs 
(mRNAs) approximately 1.7, 3.7 and 6.3 kilobases (kb), 
respectively, in length. While applicants do not wish 
10 to be bound by the following theory of mechanism, we 
believe that these larger mRNAs represent forms of 
■ppIGF-I precursor mRNAs or mRNAs encoding IGF- I gene 
related proteins (e.g. proteins consisting essentially 
of the sequence of essentially pure ppIGF-I and/or an 
15 essentially pure extension peptide). Additionally, 

our discovery that the human IGF- 1 gene of the present 
invention elaborates multiple proteins represents a 
means by which IGF- I biosynthesis and/or maturation 
may be regulated and/or a means by which IGF-I gene 
20 related proteins may be regulated and/or synthesized. 
Specifically, as IGF-I is required primarily to 
support linear growth, the levels of gene expression 
may vary during growth and development and/or may vary 
in different tissues depending upon the growth require- 
25 ments of said tissue(s). The ppIGF-I proteins may 
provide a point of regulation for mature IGF-I pro- 
duction and/or IGF-I gene related protein production 
during these growth stages and/or in a tissue specific 
manner . 

3 0 The identification and isolation of the 

ppIGF-I cDNAs and corresponding mRNA synthesized in 
vivo suggest that the ppIGF-I proteins of the present 
invention are made in vivo . Although yields are 
expected to be low, relative to commercially desirable 

3 5 yields, it is anticipated that based upon the DNA and 
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amino acid sequences provided herein, essentially pure 
ppIGF-I proteins may be isolated from tissue (i.e. 
fibroblasts, blood, liver, etc.) and/or cells therefrom. 
Such essentially pure proteins are considered to be 
part of the present invention. 

The recent identification of a rat IGF- I I 
protein containing an E domain and isolation of an 
active free peptide representing the E domain xn rat 
serum, Zumstein et al. (1985) and Hylka et al. (1985), 
suggests that a family of IGF gene related biologx- 
cally active peptides or proteins may be made xn 
vivo. such proteins would include but are not Ixmxted 
to the novel IGF-1B carboxyl-terminal extension 
peptide disclosed herein. It is also anticipated 
that proteins consisting of various combinatxons of 
IGF-I gene exons may be included in such a famxly of 
IGF gene related proteins. For example, the carboxyl- 
terminal extension peptides of pIGF-lA and/or IB may 
serve specific tissue-limited, biological functions. 
The functions of the alternatively expressed carboxyl- 
terminal peptides may be examined by those skilled xn 
the art by employing, for example, synthetic peptides 
and antibodies thereto. Alternatively, RNA splicing 
may provide a point of regulation for IGF-I gene 
expression as has been suggested for other genes, 
Rosenfeld et al . (1983), in which different combina- 
tions of exons function in different tissues. Such 
possible exon combinations are considered to be 
embraced by the present invention. 

The present invention also contemplates 
that the genes and DNA sequences provided herein may 
be employed to generate and/or aid in the isolation 
of said IGF-I gene related proteins. While the present 
invention describes a specific p P IGF-lB amino acid 
sequence encoded in such genes and DNA sequences, it 
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is anticipated that alternative open reading frames 
and thus alternative protein (e.g. amino acid sequences) 
may be encoded in the gene and DNA sequences provided 
herein. Such proteins are herein considered to be 
among the IGF- I gene related proteins provided for 
by the novel genes and DNA sequences provided herein. 

As previously discussed, it is believed 
that the novel ppIGF-I and extension peptides of the 
present invention are useful for administration to 
animals for promoting growth, increasing milk produc- 
tion and/or the lean-to-fat ratio or muscle content in 
animals. For purposes of such uses, one or more 
proteins or peptides of this invention (or non- toxic 
salts thereof) can be combined with a non- toxic, 
physiologically acceptable carrier (liquid or solid) 
to form a composition which can be administered to 
animals by any suitable technique, e.g. intravenously, 
subcutaneously, intramuscularly, intranasally , or 
orally in a form that protects the proteins or peptides 
from degradation in the digestive tract. Such composi- 
tions can be administered to the animal by injection, 
infusion or implantation, preferably in a medium (e.g. 
dispersion in oil or a polymer) which facilitates 
delivery of the peptide to target cells of the animal 
at a desired rate. One such method for administration 
to an animal (e.g. pregnant cows and heifers) is 
intramammary infusion of the protein or peptide in a 
vehicle suitable for achieving adequate dispersion in 
mammary tissue as taught in more detail in copending 
U.S. Patent Application, Serial No. 837,477, filed 
March 7, 1986. The disclosure of said application 
is incorporated herein by reference. The proportions 
of carrier and biologically active peptide in such 
compositions can be any that facilitate the desired 
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effects in animals. Preferred proportions can be 
readily determined by those skilled in the art. 

The required dosage will vary with the 
particular result sought and duration of desired 
treatment. An effective dose of the ppIGF-lB protein 
and/or extension peptide(s) is from about 60 
micrograms (ug) to about 6 milligrams (mg) per 
kilogram (Kg) of animal body weight administered 
daily for a treatment period of from about one to 
about three weeks. A more preferred dose is from 
about 400 \ig to ab° ut 800 Per Kg per day. As used 
herein, a therapeutically effective concentration is 
defined as a concentration of the particular protein 
or peptide which provides a satisfactory stimulation 
in the desired cellular growth and/or functionality. 
The most preferred amount or dosage most effective for 
achieving a desired result (e.g. increased milk pro- 
duction) can be determined by routine experimentation. 
The preferred dosage may depend on such variables as 
the size, general health and nutritional status of the 

specific animal . 

Bioactive peptides of this invention can be 
used in an essentially pure form, i.e., free. from 
other proteins or peptides (of whatever origin) having 
a significant effect on the bioactivity of the proteins 
or peptide(s) of this invention. This is not essential, 
however, as in many utilities proteins or peptides of 
this invention can be used satisfactorily (in many 
cases, even advantageously) in mixtures or other 
combinations with different proteins or peptides, e.g. 
other animal growth factors such as bovine (or other 
animal) IGF- I, EGF or TGF-a ( alpha- trans forming growth 
factor). 



-31- 07 



-24(401 )t)229750 



10 



15 



The following examples illustrate preferred 
embodiments of the present invention and are not 
intended to limit the invention's scope in any way. 
While this invention has been described in terms of 
its preferred embodiments, various modifications and 
variants thereof shall be apparent to one skilled in 
the art from reading this application. 

All enzymes including restriction 
enconucleases, T 4 DNA ligase, T 4 polynucleotide 
kinase, DNA polymerase, ribonuclease A, DNA modifying 
enzymes, wheat germ lysates, rabbit reticulocyte 
lysates, unlabelled amino acids and proteinase K were 
purchased from New England Biolabs (Beverly, Massachu- 
setts) and Bethesda Research Laboratories (BRL) 
(Gaithersburg, Maryland) and used in accordance with 
manufacturer's specifications. Nitrocellulose filters 
were obtained from Scheicher and Schuell (Keene, New 
Hampshire) and Millipore (Bedford, Massachusetts). 
Radionuclides and radiolabeled amino acids were 
20 purchased from New England Nuclear (Boston, Massachu- 
setts) and Amersham (Arlington Heights, Illinois). 
Deoxyribonucleoside triphosphate, ribonucleoside 
triphosphates and dideoxyribonucleoside triphosphates 
were purchased from Pharmacia-PL Biochemicals 
25 (Piscataway, New Jersey). 

A 42 base oligonucleotide corresponding to 
the DNA sequence encoding amino acids 10 to 23 of 
mature human IGF-I (Jansen et al . 1983 and Ullrich et 
al. 1984) was synthesized in the Department of 
30 Biological Sciences, Monsanto Company (St. Louis, 
Missouri) employing an Applied Biosystems DNA 
Synthesizer in accordance with the procedure set 
forth by the manufacturer. Applied Biosystems, Inc. 
(Foster City, California). The sequence of the 42 
35 mer was as follows: 
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5 ' -AAAGCCCCTGTCTCCACACACGAACTG- 
AAGAGCATCCACCAG-3 ' 

The 42 mer was labeled at the 5- end by using gamma 
3 2 P adenosine triphosphate and T 4 polynucleotide 
kinase in accordance with the procedure set forth by 
Maniatis et al . (1982) to achieve a specific activity 
of about 10' dmp per picomole. The radiolabeled 42 mer 
was then employed to screen for IOF-I containing DNA 
sequences in a human liver cDNA library made in 
accordance with the procedures set forth by Kwok 
S C M. et al. (1985) and Gubler and Hoffman (1983). 
The human liver cDNA library in lambda gtll was 
obtained from Drs . S.L.C. Woo and T. Chandra (Baylor 
University School of Medicine, Houston, Texas). 
Lambda gtll may also be obtained from the American 
Type Culture Collection (ATCC) (Rockville, Maryland) 
under ATCC accession number 37194 and a human cDNA 
liver library prepared therein in accordance with the 
procedure described by Kwok, S.C.M. et al . (1985). 
E coli K12 strain Y1088 may be obtained from ATCC 
"leTlTCC accession number 37195. The plasmids pGEMl 
and PGEM2 were obtained from Promega Biotech (Madison, 
Wisconsin) - 

Example 1 

The following example demonstrates the 
isolation of IGF-1A and IGF-1B cDNA-s from a human 
liver cDNA library- 
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Briefly, the human liver cDNA library in 
lambda gtll was screened as follows. Lambda gtll was 
plated on E_;_ coli K12 strain Y1088 in accordance with 
the method described by Young and Davis (1983). 
Duplicate nitrocellulose filters were prepared in 
accordance with the procedure described by Woo, 
S.L.C. (1979) and hybridized at 42 °C for about 20 
hours using 2 x 10 6 dpm of probe per filter in buffer 
containing 5 x SSC (1 x SSC = 150 mM NaCl, 15 mM Na 
citrate, pH 7.0), 50 mM Na phosphate, pH 6.8, 40% 
(v/v) deionized formamide, 50 micrograms/milliliter 
denatured salmon sperm DNA, 5 x Denhardt solution 
[0.1% (w/v) ficoll, 0.1% (w/v) bovine albumin, 0.1% 
(w/v) polyvinyl pyrolidone, Maniatis et al. (1982)]. 
Following hybridization the filters were washed for 
15 minutes at 22°C and for 15 minutes at 40°C in a 
solution containing 0.2 x SSC, 0.1% (w/v) sodium 
dodecyl sulfate (SDS) and were thereafter exposed to 
Kodak XAR5 film (Eastman Kodak Co., Rochester, New 
York) using intensifying screens. Positive plagues 
were rescreened at lower density. Specifically, the 
primary screening employed about 2.5 x 10 4 plagues 
per 137 millimeter filter and the secondary screening 
employed about 1-2 x 10 2 plagues per 87 millimeter 
filter. Phage DNA from the positive plagues was then 
prepared as described by Helms et al - (1985) and 
mapped by restriction enzyme digestion and gel 
electrophoresis, Maniatis et al . (1982), and subse- 
quently sequenced. 

Approximately 5 x 10 5 plagues of the human 
liver cDNA library were screened as just described 
with the iGF-I-specific 42 mer probe leading to the 
isolation of 7 positive plagues with DNA inserts 
ranging from about 800 to about 1150 nucleotide pairs. 
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By restriction mapping with EcoRI , BamHI and PstI, the 
cDNA's were found to be of two types. Two of the 7 
positive plaques containing inserts of approximately 
800 to 850 nucleotide pairs contained internal Bam HI 
restriction endonuclease sites and were found to 
correspond to the IGF-I cDNA reported by Jansen et al . 
(1983). These inserts were designated IGF-1A cDNA. 
The remaining 5 positive clones shared a restriction 
endonuclease map that differed from the map of the 
IGF-1A cDNA. These 5 clones were designated IGF- IB 
cDNA. The largest two inserts, shown as AIGF-2 and 
XIGF-5- in FIG- 1 were then selected for DNA sequence 
analysis . 

DNA sequencing was performed by the dideoxy 
chain terminating method described by Sanger et al, 
(1977) and Biggin et al . (1983) after subcloning 
restriction fragments into M13mpl8 and M13mpl9 
bacteriophage as described by Norrander et al. (1983). 
The sequencing strategy is shown in FIG. 1. The DNA 
sequences of the two IGF-1B cDNA isolates were 
determined in their entirety on both strands and 
across all restriction enzyme sites used as initiation 
points except for the extreme 3 1 -end of one clone 
MGF-5 (see FIG. 3), which was sequenced three times 
in only one orientation. 

Both isolates gave identical results over 
shared regions . The DNA sequence and amino acid 
translation appear in FIG. 2. The aggregate IGF-1B 
cDNA consists of 113 6 nucleotides including 42 
deoxy adenosine residues of the poly A tract. The 
size of the IGF-1B cDNA agrees with the size of the 
major mRNA (900 to 13 50 nucleotides) determined by 
filter hybridization, described more fully below. 
The sequence can be divided into three sections: a 
5 ! untranslated region comprising the initial 182 
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nucleotides; an initiation codon- and an open reading 
frame of about 585 nucleotides (195 codons) followed 
by an opal ( TCA) ' termination codon; and a 3' 
untranslated region of about 368 nucleotides ending 
in a poly A tract. 

As shown in FIG. 2, the 585 nucleotide open 
reading frame begins with the second in phase ATG 
codon. The first ATG at nucleotides 84 to 86 is 
followed immediately by an in frame opal terminator. 
The open reading frame shown in FIG. 2 encodes a 
ppIGF-I of about 195 amino acids with a molecular 
weight of about 21,841 daltons assuming that the ATG 
codon at bases 183 to 185 initiates protein synthesis. 
The mature IGF- I protein sequence is encoded by 
nucleotides 327 to 536 denoted by the crosshatched 
area in FIG. 1 and underlined in FIG. 2. The 70 
mature IGF-I codons are followed by a unique and novel 
carboxyl-terminal extension of 77 amino acids. 
Although the first 16 amino acids of this carboxyl- 
terminal extension are identical to the first 16 amino 
acids of the IGF-1A carboxyl-terminal extension, the 
remaining 61 amino acids are unique as determined by 
lack of homology with the IGF-1A carboxyl-terminal 
extension or with any other protein in the National 
Biomedical Research Foundation Protein Sequence Data 
Bank. Dayhoff et al . (1983) and Wilbur and Lipman 
(1983) . 

Confirmation of the in vivo expression of 
two distinct ppIGF-I proteins encoded in the IGF-1A 
and -IB cDNA's, respectively, was achieved by iden- 
tifying messenger RNA molecules encoding these 
proteins. Specifically, liver polyadenylated RNA was 
isolated from tissue fresh-frozen at -70°C by extrac- 
tion with guanidinium thiocyanate in accordance with 
the procedure described by Chirgwin et al. (1979) and 
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one round of chromatography on oligo dT cellulose are 
described by Aviv and Leder (1972). The polyadeny- 
lated RNA was denatured with glydxal as described by 
McMaster and Carmichael (1977), electrophoresed 
5 through a 1.25% (w/v) agarose gel and transferred to 

a nitrocellulose filter by blotting in accordance with 
the procedure described by Thomas (1980). Complete 
IGF-1A and IB cDNA's or DNA's encoding either, the 
IGF-1A or -IB carboxyl-terminal extensions were all 

10 individually labeled with 32 P by nick translation as 

described by Rigby et al . (1977) to 8-12 x 10 8 dpm per 
microgram and hybridized to the filters at 42°C in a 
solution containing 50% (v/v) formamide, 5 x SSC, 
50 mM Na phosphate, pH 6.5, 100 micrograms per 

15 milliliter denatured salmon sperm DNA, 1 x Denhardt 

solution and 10% (w/v) dextran sulfate. Filters were 
washed for 15 minutes at 22 °C in buffer containing 0.2 
x SSC and 0.1% (w/v) SDS, for 30 minutes in two 
changes of the same buffer at 48°C, and then autora- 

20 diographed using intensifying screens for 62 hours at 
-70°C. 

Both IGF-1A and -IB cDNA's hybridized to 
RNA transcripts in human liver. Hybridization of 
either the unique carboxy- terminal extension of IGF-1A 

25 or -IB cDNA shows a major band of approximately 900 to 
about 1350 nucleotides. Other larger bands were seen 
at 1.7, 3.7 and 6.3 kilobases potentially, respec- 
tively, processed precursor mRNA's or alternatively 
other IGF-I gene related mRNA's. Parallel experiments 

30 using entire IGF-1A and -IB cDNA's yielded similar 
results. 

Example 2 



This example demonstrates the isolation and 
mapping of the human IGF-I genomic gene. 
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A human genomic library was prepared using 
a hybrid vector, \MG14, Helms et al. (1985), and 
size- fractionated human leukocyte DNA, partially 
digested with Mbol as described by Maniatis et al - 
(1982). Genomic libraries may be prepared as 
described by Maniatis et al . (1982). Approximately 4 
x 10 5 plaques from this library and another derived 
from human fetal liver, described by Lawn et al. 
(1978), were screened in accordance with the methods 
described by Maniatis et al . (1982) using 32 P-labeled 
IGF- I cDNA's, prepared by nicked translation as 
described by Rigby et al . (1977), as hybridization 
probes. DNA from plague-purified positive isolates, 
Helms et al . (1985), was mapped using Bam HI, Eco RI 
and Hind III single and double digestions by 
hybridization to 32 P IGF-I cDNA's and human Alu 
probes, prepared as described by Schmid and Jelinek 
(1982). From 15 genomic clones containing IGF-I 
exons, 8 were selected for further analysis consisting 
of restriction endonuclease mapping and Southern 
blotting. The eight clones selected and genomic DNA 
regions contained therein are shown in FIG. 3(a) and 
are marked MGF- followed by a number. 

The results of these analyses are summarized 
in FIG. 3(a) which presents a restriction map of the 
isolated genomic IGF-I gene. Five regions of the 
isolated DNA hybridized to the cDNA probes and were 
designated exons 1-5, shown as the numbered boxes in 
FIG. 3(a). Exons 1,2,3 and 5 comprise IGF-1A mRNA 
(e.g. DNA encoding pIGF-lA), and exons 1, 2, 3 and 4 
comprise IGF-1B mRNA (e.g. DNA encoding pIGF-lB) . 
From the beginning of exon 1 to the end of exon 5, the 
gene extends for more than 45 kilobases (kb). The 
incompletely characterized DNA region between exons 2 
and 3 was determined to represent an intron as that 
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sequence interrupts an asparagine codon in the B 
domain of mature IGF-I and, thus, cannot contaxn an 
additional exon. As shown in FIG. 3(a), wxthxn the 
genomic IGF-I gene are five regions which hybrxdxze 
strongly to middle repetitive DNA -of the Alu type, 
Schmid and Jelinek (1982). These Alu type .e^-nces 
maP to intervening and flanking DNA sequences. Schmxd 

and Jelinek (1982). 

Subclones containing each exon were prepared 

in PUC18 and P UC19 plasmids in accordance with the 
procedures described by Norrander et al . (1983) for 
further restriction mapping, and in M13m P 18 and 
M13 mpl9 in accordance with the procedures desc rxbed by 
Norrander et al . (1983), for DNA seguencxng. M 13m P 18, 
M13m P 19, PUC18 and P UC19 may be obtained from New 
England Biolabs (Beverly, Massachusetts). The DNA 
sequence analysis was performed in accordance wxth the 
procedures described by Sanger et al. (1977) and 
Biagin et al . (1983) using dideoxy chain- termxnatxng 
inhibitors, 3*S-dATP and both standard and denaturxng 
ael electrophoresis. The DNA sequencing strategy 
employed is diagramed in FIG. 3(b). Initial DNA 
sequencing of exons 3 and 4 was obtained after 
preparing a series of overlapping deletions us xn g Bal 
31 exonuclease, described by Poncz et al . (1982), 
indicated by the solid circles in the seguencxng 
scheme shown in FIG. 3(b)- All other sequence analy- 
sis was initiated at specific restriction endonuclease 
sites as depicted by the short vertical lines xn FIG. 
3(b) The arrows indicate the extent of sequence 
determined. Except for 100 nucleotides of the xntron 
preceding exon 2 and a portion of exon 4, J**"*™ 
presented in FIG . 4 were verified from both DNA 
strands, including all restriction endonuclease sites 
used as initiation points. All exon-intron splice 
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junctions and polyadenylation sites, were., determined by 
comparison with IGF-1A and IGF-1B cDNA sequences. 

Southern blot analysis was conducted briefly 
as follows. Ten micrograms of DNA from human 
leukocytes, Maniatis et al . (1982), were digested 
with various restriction enzymes Hind III, Pvu II, 
Eco RI, Bam HI and Bgl II and then electrophoresed 
through 0.8% (w/v) agarose gels at 20-25 volts in 
buffer containing 0.089 M Tris-borate, 0.089 M boric 
acid and 0.001M Na 2 EDTA. DNA fragments were trans- 
ferred to nitrocellulose filters by blotting in 
accordance with the procedure described by southern 
(1975). Prehybridization of the filters, hybridiza- 
tion to radiolabeled IGF- I cDNA probes, prepared as 
previously described, and posthybridization washes 
followed the procedure of Wahl et al . (1979). 
Hybridizing bands were detected by autoradiography. 

Southern blot analysis was performed on 18 
unrelated adults of normal stature using IGF-1A and 
IGF-1B cDNA probes and the restriction enzymes listed 
above. We found the site polymorphisms for Hind III 
and Pvu II reported by Bell et al . (1985) to be linked 
to one another and to map exclusively near exon 5 of 
the genomic IGF-I gene. 

The complete genomic IGF-I gene sequence is 
presented In FIG. 4. The region of exon 4 sequenced 
in one orientation agrees completely with the corres- 
ponding cDNA sequence encoding the carboxyl-terminal 
' extension peptide of pIGF-lB. The DNA sequence of 
exon 5 agrees completely with the corresponding cDNA 
sequence encoding the C-terminal extension peptide of 
pIGF-lA. 
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By primer extension analysis using human 
liver RNA and a 32 P-labeled oligonucleotide, 5 1 TGAG 
AGCAATGTCACATTTC3 1 , the 5 1 -end of the ppIGF-I mRNA 
was found to extend 583 nucleotides upstream of the 
primary site at position 1108, FIG. 4. If the 5 f 
untranslated region of the gene does not contain an 
intron, transcription must begin at position 525 or 
526, FIG, 4. If an intron is contained within the 5 f 
untranslated region, transcription would most likely 
begin further upstream (5 1 ) to positions 525 or 526. 
The IGF-1B cDNA shown in FIG. 2 extends 182 nucleo- 
tides 5 1 to the translation initiation codon and 
contains several upstream in-phase translation ter- 
mination codons as denoted in FIG. 4. A region of 
42 nucleotides further 5 f at positions 199 to 241, 
FIG. 4, in the sequence contains alternating purine 
and pyrimidine residues with the potential to form Z 
DNA, indicative of actively transcribed DNA. Nordheim 
and Rich (1983 ) . 

Example 3 

This example demonstrates the expression in 
vitro of IGF-1A and IGF-1B precursor peptides and 
demonstrates the ability of the first in-frame 
methionine (ATG) codon to signal the start of ppIGF-I 
translation. 

In the present example, human IGF-1A and 
IGF-1B cDNAs were separately ligated to the plasmids 
pGEMl and pGEM2 to create recombinant plasmids con- 
taining an IGF- I cDNA with its 5'- end adjacent to the 
bacteriophage T 7 promoter contained within the pGEM 
plasmids. Specifically, the IGF-1A cDNA was ligated to 
a pGEMl plasmid previously digested with EcoRI . The 
correct orientation of the IGF-1A cDNA in the recombinant 
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plasmid was verified by restriction endonuclease 
analysis employing BamHI . Recombinant pGEM2 plasmids 
containing IGF-1B cDNA were constructed by ligating a 
Rsal digested IGF-1B cDNA containing an EcoRI linker 
at its 3'-end to a pGEM2 plasmid previously double 
digested with EcoRI and Hindi. The correct orien- 
tation of the IGF-1B cDNA with respect to the T 7 
promoter in the recombinant plasmid was verified by 
digest with PstI . After isolation of the recombinant 
plasmids, designated pGEMl/IGF-lA and pGEM2/IGF-lB, 
respectively, each plasmid was linearized at the 3 f 
end of the respective IGF-1 DNA sequences using 
enzymes Hindi I I and EcoRI, respectively. Comple- 
mentary RNAs, designated IGF-1A cRNA and IGF-1B cRNA, 
were prepared by transcription in vitro using T 7 RNA 
polymerase (Davanloo et al . , 1984), according to the 
method of Melton et al. (1984). In order to increase 
the efficiency of subsequence translation, each RNA 
was co-transcriptionally 'capped 1 , following the 
procedure of Hart et al. (1985). Capped, full-length 
IGF-1A and IGF- IB cRNAs were translated in vitro using 
both wheat germ and rabbit reticulocyte lysates 
(Pelham and Jackson, 1976), in the presence of 
35 S-methionine or 3 H-leucine and the 19 remaining 
complement of unlabelled amino acids, with equivalent 
results. The primary translation products were 
assessed by polyacrylamide gel electrophoresis 
(Laemmli, 1970), in 12% (w/v) or 15% (w/v) acrylamide, 
followed by autoradiography. The experimentally 
determined primary IGF-1A translation product demon- 
strated a mobility of approximately 17,000 daltons , 
in excellent agreement with the predicted molecular 
weight of the 153 amino acid precursor, 17,026 daltons. 
Additionally, the amino-terminal amino acid sequence 
of this IGF-1A peptide synthesized in vitro was 
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determined and agreed completely with the sequence of 
the primary translation product predicted from the 
IGF-1A cDNA. Translation thus begins with the 
first in- frame methionine codon (e.g. base pairs 
5 59-61 for IGF-1A, FIG. 5) and not at methionine 

residues 24 or 27 (e.g. base pairs 128-130 or 137-139, 
respectively, see FIG. 5). IGF-1A was thus synthesized 
as a 153 amino acid precursor. Similarly, the 
experimentally-determined primary IGF-1B translation 
10 product migrated on a polyacrylamide gel with a 

mobility of approximately 22,000 daltons, also in 
excellent agreement with the predicted molecular 
weight of the 195 amino acid IGF-1B, 21,841 daltons, 

as defined by the IGF-1B cDNA sequence. Thus, 
15 IGF- IB was produced as a 195 amino acid molecule 

corresponding to the protein encoded in the synthetic 

DNA and essentially pure IGF-I gene of the present 

invention . 

Various other examples will be apparent to 
20 the person skilled in the art after reading the 

present disclosure without departing from the spirit 
and scope of the invention, and it is intended that 
all such other examples be included within the scope 
of the appended claims . 
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CLAIMS 

1. A synthetic DNA sequence encoding a 
preproinsulin-like growth factor- I protein comprising 
the sequence of amino acids shown in Figure 2* 
5 2 . A synthetic DNA sequence comprising 

the sequence of nucleotides from about nucleotide 1 
to about nucleotide 1136 shown in Figure 2. 

3 . A synthetic DNA sequence comprising 
the sequence of nucleotides from about nucleotide 183 

10 to about nucleotide 767 shown in Figure 2. 

4. A synthetic DNA sequence encoding a 
mature human IGF- I peptide linked directly at its 
carboxy-end to the amino terminus a peptide comprising 
the following sequence of amino acids: 

15 NH 2 -Arg'Ser Val Arg Ala Gin Arg His Thr Asp Met Pro Lys Thr Gin 

Lys Tyr Gin Pro Pro Ser Thr Asn Lys Asn Thr Lys Ser Gin Arg Arg 
Lys Gly Trp Pro Lys Thr His Pro Gly Gly Glu Gin Lys Glu Gly Thr 
Glu Ala Ser Leu Gin He Arg Gly Lys Lys Lys Glu Gin Arg Arg Glu 
He Gly Ser Arg Asn Ala Glu Cys Arg Gly Lys Lys Gly Lys - C00H. 

20 5. A DNA sequence of Claim 4 comprising 

the sequence of nucleotides from about nucleotide 327 
to about nucleotide 767 shown in Figure 2. 

6 . A synthetic DNA sequence encoding a 
peptide comprising of the following sequence of amino 

2 5 acids: 

NH 2 - Arg Ser Val Arg Ala Gin Arg His Thr Asp Met Pro Lys Thr Gin 
Lys Tyr Gin Pro Pro Ser Thr Asn Lys Asn Thr Lys Ser Gin Arg Arg 
Lys Gly Trp Pro Lys Thr His Pro Gly Gly Glu Gin Lys Glu Gly Thr 
Glu Ala Ser Leu Gin He Arg Gly Lys Lys Lys Glu Gin Arg Arg Glu 
30 He Gly Ser Arg Asn Ala Glu Cys Arg Gly Lys Lys Gly Lys - C00H. 
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7. A DNA sequence of Claim 6 comprising 
the sequence of nucleotides from about nucleotide 
537 to about nucleotide 767 shown in Figure 2. 

8. A synthetic DNA sequence encoding a 
peptide comprising of the following sequence of 
amino acids : 

NH 2 -Tyr Gin Pro Pro Ser Thr Asn Lys Asn Thr Lys Ser GLn Arg Arg 
Lys Gly Trp Pro Lys Thr His Pro Gly Gly Glu Gin Lys Glu Gly Thr 
Glu Ala Ser Leu Gin lie Arg Gly Lys Lys Lys Glu Gin Arg Arg Glu 
lie Gly Ser Arg Asn Ala Glu Cys Arg Gly Lys Lys Gly Lys - C00H. 

9. A DNA sequence of Claim 8 comprising 
the following sequence of nucleotides: 

5 < -TATCAGCCCCCATCTACCAACAAGAACACGAAGTCTCAGAGAAGGAAAGGTTGGCCA 
AAGACACATCCAGGAGGGGAACAGAAGGAGGGGACAGAAGCAAGTCTGCAGATCAGAGGA 
AAGAAGAAAGAGCAGAGGAGGGAGATTGGAAGTAGAATGCTGAATGCAGAGGCAAAAAA 

GGAAAATGA-3' . 

10. An essentially intron-free DNA sequence 
encoding a preproinsulin-like growth factor-I comprising 
the sequence of amino acids shown in Figure 6. 

11. An essentially pure preproinsulin-like 
growth factor-I protein comprising the sequence of 
amino acids shown in Figure 6. 

12. An essentially pure mature human 
insulin-like growth factor-I peptide linked directly 
at its carboxy-end to the amino-terminus of a peptide 
comprising the following sequence of amino acids: 



O £. & » r 



-49- 07-24(401)A 

NH 2 -Arg Ser Val Arg Ala Gin Arg His Thr Asp Met Pro Lys Thr Gin 
Lys Tyr Gin Pro Pro Ser Thr Asn Lys Asn Thr Lys Ser Gin Arg Arg 
Lys Gly Trp Pro Lys Thr His Pro Gly Gly Glu Gin Lys Glu Gly Thr 
Glu Ala Ser Leu Gin He Arg Gly Lys Lys Lys Glu Gin Arg Arg Glu 
He Gly Ser Arg Asn Ala Glu Cys Arg Gly Lys Lys Gly Lys - COOH. 

13. An essentially pure extension peptide 
comprising the following sequence of amino acids: 

NH 2 - Arg Ser Val Arg Ala Gin Arg His Thr Asp Met Pro Lys Thr Gin 
Lys Tyr Gin Pro Pro Ser Thr Asn Lys Asn Thr Lys Ser Gin Arg Arg 
Lys Gly Trp Pro Lys Thr His Pro Gly Gly Glu Gin Lys Glu Gly Thr 
Glu Ala Ser Leu Gin He Arg Gly Lys Lys Lys Glu Gin Arg Arg Glu 
He Gly Ser Arg Asn Ala Glu Cys Arg Gly Lys Lys Gly Lys - COOH. 

14. An essentially pure extension peptide 
comprising the following sequence of amino acids: 

NH 2 - Tyr Gin Pro Pro Ser Thr Asn Lys Asn Thr Lys Ser Gin Arg Arg 
Lys Gly Trp Pro Lys Thr His Pro Gly Gly Glu Gin Lys Glu Gly Thr 
Glu Ala Ser Leu Gin He Arg Gly Lys Lys Lys Glu Gin Arg Arg Glu 
He Gly Ser Arg Asn Ala Glu Cys Arg Gly Lys Lys Gly Lys - COOH. 

15. An es&entially intron-free DNA sequence 
encoding the peptide of Claim 12. 

16. An essentially intron-free DNA sequence 
encoding the extension peptide of Claim 13. 

17. An essentially intron-free DNA sequence 
encoding the extension peptide of Claim 14. 

18. A recombinant DNA vector containing a 
DNA sequence encoding the protein of Claim 11. 

19 . A recombinant DNA vector containing a 
DNA sequence encoding a protein selected from the 
group consisting of the protein of Claim 12, the 
protein of Claim 13 and the protein of Claim 14. 
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20. An essentially pure protein encoded in 
a human messenger RNA molecule which RNA molecular 
hybridizes to a DNA sequence encoding a preproinsulin- 
like growth factor- I protein comprising the amino acid 
sequence shown in Figure 6. 

21. A protein of Claim 20 in which the 
human messenger RNA molecule is selected from the 
group consisting of a RNA molecule 1.7 kb, 3.7 kb and 

6.3 kb in length. 

22. An essentially pure protein encoded in 
a human messenger RNA molecule which messenger RNA 
molecule hybridizes to a DNA sequence encoding a 
peptide comprising the following amino acid sequence: 

NH 2 - Tyr Gin Pro Pro Ser Thr Asn Lys Asn Thr Lys Ser Gin Arg Arg 
15 Lys Gly Trp Pro Lys Thr His Pro Gly Gly Glu Gin Lys Glu Gly Thr 

Glu Ala Ser Leu Gin lie Arg Gly Lys Lys Lys Glu Gin Arg Arg Glu 
He Gly Ser Arg Asn Ala Glu Cys Arg Gly Lys Lys Gly Lys - C00H. 

23. A rotein of Claim 22 in which the human 
messenger RNA molecule is selected from the group 

20 consisting of a RNAmolecule 1.7 kb, 3.7 kb and 6.3 kb 
in length. 

24- An essentially pure human IGF-I gene 
encoding at least two preproinsulin-like growth 

factor- I proteins. 

25. The gene of Claim 24 comprising the 
sequence of nucleotides from about nucleotide 1 to 
about nucleotide 4349 shown in Figure 4. 

26. The gene of Claim 24 encoding a 
preproinsulin-like growth factor- I protein comprising 

30 the sequence of amino acids shown in Figure 6. 
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27. A method for promoting growth in an 
animal comprising administering to the animal a 
therapeutically effective amount of a protein as 
defined in Claim 11. 
5 28. A method for enhancing lactation in a 

cow comprising administering to the cow a therapeu- 
tically effective amount of a protein as defined in 
Claim 11. 

29. A therapeutic composition for 
10 promoting growth comprising a therapeutically 

effective amount of the protein of Claim 11 and a 
pharmaceutically acceptable carrier or diluent. 

30. A therapeutic composition for 
enhancing lactation in a cow comprising a 

15 therapeutically effective amount of the protein of 

Claim 11 and a pharmaceutically acceptable carrier or 
diluent. 

31. An essentially pure DNA molecule 
comprising the sequence of nucleotides in the IGF- I 

20 gene exon 4 shown in Figure 4* 



0229750 




02297 



o 4 


o ft- c 


O CI -J 


o u 


o o • 


O O 3 


— »— 


r* 4 v» 


o • 


u 


(J C 


U -J 


ft- 


o • 


ci * 


u 


4 v» 


O X 


o 


u • 


»— o 




ft- — 


O 3 


ft- 


4 — 


ft- * 


< 


4 • 


o -J 


o u 


o < * 




3» 4 • 


0» 4 -* 


au- 


O • 


— 4 X 


NO < 




o 


O 3 


(3 


O O 


fr— • 


»- 


w *■ 


ci -J 


4 




O Ik 


< 


5i 


4 X 


< 


4 


ft— ft— 


O 


ci 


u • 


o »- 


O O 


O 1- £ 




S 4 


m h- ol 


< 




r* CI 3 


< 


'3 


»- • 


4 


4 


U -J 


■< 


ci 


ft- 4 


ci 




< — 


ft- 


ft- 


o r 


CI 


CI 


(3 <- 


ci 


4 


ci • 


o »- 


o »- 


O ft- vi 


(J 


f- O 


r» ci c 


< " 


— *- 


fM CI • 


4 


ft- 


ft- V* 


< 




CI C 


ft- 


u 


u • 


< 




ft- tn 


4 


3 


(3 ** 


O 


< 


ft- • 


O 


U 


< a 


o »- 


o *- 


O CI (- 


« < 


to »- 


our 




— ft- 


CM 4 ft- 


u 


ft- 


u « 


13 


4 


4 — 


ci 


ci 




O 


ft- 


o — 


fr- 


ft- 




CC 


4 
4 


Si 



o o 
»f> 4 
o 



< 

o 

< 

u 

O 

o < 
^ o 
< 

ci 



4 
< 

O CI 

n O 

ci 
4 

u 



(J 
o 4 

U 



4 

< 1 
4 1 
ft- I 
o 
o o 



o u 
c> 



o 
o 

ft- 
u 

ci 

O ft- 



u 
o o 
n ft- 



13 
o 



o 
ft- 
u 
ft- 
u 
ft- 
o 4 

— u 

— u 



< 



o 4 x 

4 -J 

M O - 

I- • 

o > 

o « 

4 X 

4 -i 
O 3 



r» ft- £ 
ft- a 



3: 



<3 X 
ft- u 
o >- • 
r> ft- C 
r< h- a. 
u « 
O >■ 
ft- U 
u « 
t3 x 
»- o 
(3 « 
o < x 

N < -I 
M ft- • 

ft- £ 

»- a 

< 3 
ft- • 

ft- -J 

< C 

ou3 

- W 1- 

t* u c 

< ft- 

< o 

u c 
u a 

ft- 3 
ft- • 

o ^ 



O ft- £ 

o *- a 
^ >- *- 

< x 

>- 

ft- • 

ft- £ 

ft- a 

<-> > 

o S 3 
ado 
n i3 w 
«< ■< 
u a 

32 



« (3 >• 
« ft- U 
t3 - 

ft- m 

a > 

o • 

t:£ 

(3 C 
O « — 
^ t_> <3 
O ft- J 



ft- • 

u — 
13 < 
*- a 

< * 
o o < 

13 — 
r> ft- • 
O > 
O 3 
ft- * 
u — i 
O 3 

< — 
(3 O 

O ft- « 

cn 



S3 

u « 

o > 
ft- o 
O 3 
O ft- • 
f u J 

not. 

£ 

au a 



»- ■ 

u — 

O £ 



O O • 
P< U — 
O O < 
ft- c 
u • 

ft- VI 

o c 
o • 

< iA 

W b 

our 
— < ft- 
n u v 

ft- £ 

ft- a 

o c 

U £ 

< ft- 

U 3 



O (3 3 ■ 


O < ft- 




o o — 


o < 


o o 


O < 




a 


Q ^ — > t 


O ft- c 




o o o 


o < 


O ft- 


Q < 


o < 


o 


V3 O O * 


9 CI • 






m ci 




O CI 






3 1 
w «* » 


ft— v) 




O w 


13 


o 


o 


•* < 




ft— • 1 


4 o 






S 


ft- 


o 








CI c 




CI s 


< 


13 




< 




(3^1 


O Ok. 




ft— — 


o 










Oct 


CI o 




4 — 




< 


< 


< 






CI c 




O C 








< 




O ^ i 


CI & 




4 *— ' 








< 




O O C i 


O (3 C 




O CJ o 


o o 


o < 


O O 


O ft— 


o 








9 O 3 


Ok 4 


oft o 


at < 


Oi ft— 


a 


V ^ T 1 


M3 3 O 




ft— V 


CI 


n ft— 


ai < 


O ft- 










CI «J 






< 


— > ^ 




t3 ' 1 


5 X 




1— c 








4 






ft— ft— 


* 


O V 


< 




< 


o 






14 « 
w — 








ft— 




4 




3 5 ! 


4 X 






ft- 






4 




ft— • i 


^ j 






5 












3 c 




O 4 






43 






o •— cT 1 


- n ^ w 




O 4 3 


o 3 


o o 


O CI 


O ft— 


o 


9 CI C 1 


v o O 




o o 


•o < 


0 O 


•> o 


m 4 


m 


^ ij I i 






f- o 


S ft— 




a 4 




* v^ t 


CJ £ 




4 c 


3 




< 


mm < 




C 9 1 






CJ £ 




^3 








13 C 1 


3- 




4 ft— 




^> 


o 


ft* 




^ j < 


4 X 




O x 




CI 




4 












3 


o 




4 






yj ? 






3 




4j 


4^> 




^ j 


CI c 




S 3 




3 


(J 








2 ^ 




O 4 — • 






o o 


O CI 


o 


? *• ^ i 


[T "3 ** 






O • 


K ft— 


N O 


ft^ 4 


ft* 


▼ ft— CI 1 


■If"* 








• o 


Oft ft— 


O 4 




O <• 1 






* i ■ ' 










O >> 1 


3 \ 
















ft- CI 1 






w C 












•J J » 








4 X 




^3 


4j 




4 ■ • • 


w — 




o o 




o 




|b 




o o * 


CI £ 






o o 






4J 




ft— CL * 














4j 




2 ^5 ? ! 


O CI * 




o 3 o 


O 4 X 


o o 


O 4 


O CI 




V w ^ l 


(0 < — 




CS O X 


*o 4 J 




<0 4 


<o O 




f o — " • 


ftO CI I 




^ o •— 






0ft — c 


O ft— 




i— • i 


ci a 




W w 


4 X 




43 


— • CI 


_ 


O > i 


o c 




4C X 


^ J 






ft- 




^ ! 


CJ < 




o ~~ 


CI X 


<3 




CI 






o c 




o o 


3 o 


O 




^. 




■r 2 


< — 




^ ? 












o o 






3 ? 


3 


s 






O *— 1 


CI « 














O O O ) 


a u — 




,_, ~ ~ 
w ft— ■ 


O 4 4 


0 ^ 


o o 


o o 




tfl < C ( 


tfl o < 










If) 4 


ftD 4 




V U C 1 


^ ft- 01 




CO CI X 


K o x 


Id 4 


Oft 4 


O CI 






O c 




< u 


ft— CI 






— - ^ 




3 e ! 


u « 




U £ 








(J 






u — 










43 


4 




J J 1 


ft- « 




o * 


O O 




< 






? ! 


o > 






»- « 




43 


4 




w C 1 


ft- c 






1 u - 












u • 




< o 


o < 


< 








O w • * 


O ft- vi 




O U c 


o c 


o o 




O 4 






* c> a 




w u a 




V < 


V CI 


4 




^ O ^ ' 


ftD o c 




3 c 


t— 4 < 


03 CI 


ct < 


O 4 




3 a i 


u < 




< a 


o 


< 


— 4 




Oct 


ft- m 






O c 


< 




CI 




3 c i 


ci - 




ft— X 


< < 


< 


o 


o 




o < 




o * 




< 


< 


3 




< i. 




3 o 


O • 


< 


« 


4 




C» < 1 


CI • 




< « 


4 V) 


3 


ft- 


3 




ft- C 1 


ft- VI 


i 


< 'X 


< X 




u 


4 




o o • i 


o o « 


i 


O < J 




o < 


o o 


O 4 


o 


r> < vn i 


PIOl 




S 3 o 


« 3 


n u 


n 4 


n 


* CI *- » 


10 < J 


i 


« a - 


^* • 




» 


a 4 




O • l 


U fl 


i 


< < 


»- — 


< 


u 


•» 




4 VI t 


CI - 




< a 


^ — 


< 


ft- 


4 




U C t 


o < • 


O c 


O 3 


u 


< 


ft- 




CI • 1 


ft- a 


i 


< < 




o 


3 


ft- 




ft- VI 1 


u L 




o c 




< 




4 




CI > 1 


ci a i 


< — 


8? 


CI 


3 


4 




o - < 


O i 


i 


CI O 




< 


9 < 


4 




o o o * 


o < X I 


O ft- - 


o < < 


o o 


a 4 


o 


CM ft- C 1 


C4 < - 


> i 


<N U • 






(N ^ 


fx 4 




f < XI 


U 3 i 


0 ft- Vt 




m u 


01 ft- 


O ft- 




ft- ft- 1 


ft* • i 


o « 


< < 


o 


ft- 


- — 4 




(3 X 1 


U ^ 1 


< X 

3t 


o c 




ft- 


4 






ci O I 






Cl 


4 




So.! 


O W 1 






u 


4 




< C 1 


ci a i 


CI £ 




3 




4 




CI £ 1 


< « t 


< ft- 


35 

a < # 




< 


< 




4 ft- 1 


U — 1 


Cl c 


o3 


ft- 


4 




<3 CI O 1 


o o < » 


2 * • 




oo 


O 4 


o 


- CI C 1 


-Oil 


« 3 • 

< X 


- < X 




— < 


— 4 




<r o a i 


vj O > i 


-3i 


S3 


•3 


a < 




o * I 


ft- u I 








— 4 




< X 1 


w k 1 


< -J 


< > 


3 


< 


3 




< -j i 


< X 1 


CI c 






ft- 






U C ( 


ft- ft- 1 


4 • 




ft- 


< 


ft- 




< « t 


o - i 


4 < 


4 X 




< 


4 




< < 1 


ft- • 1 


o c 




5 


3 


4 




U « 1 


< a i 


CI £ 


< X 






4 





to 

\1 



z — 
X - 



x — 
z 



22 

. o 
5 



-4 ,i 



x 



l'l 



§ ■ [II 

I— t — 1 1 



it* 



0229750 



0 i 
1 — *< — 



2 — 



• ••J 



ft I 



022975b 



< U 

o u 

0 < 



< (J 

o < 

f- u 

o < 

0 o 

u < 

< o 

< < 



* u 
u < 

«-» 

0 < 

< < 

< o 

0 



13 O 
o «< 

o o 



o «< 

< o 

(J 

u O 

o »- 

< o 

O I- 

o o 

< w 

0 H- 

U CI 

0 

o o 

U «< 

0 u 

0 < 

*- o 

H < 

o o 

»- 0 

< o 

0 H- 

o o 

< H- 

13 O 

0 < 

H O 

•< 

u o 

O <: 

o « 

»- ^ 

0 < 

0 o 

< < 



o 


o 




m 






< 


< 








< 




u 




u 




t- 




(J 




u 


0 






(_) 


>•* 


0 










^> 


{J 


(J 


o 


{J 


^_ 


0 


(J 




13 






0 




0 








0 












r* 




0 




•X 




0 




^ 


0 


o 


0 


0 


0 


0 




0 












1*1 






o 


o 


u 


w 


o 


C5 


»- 


o 


0 


o 


< 


0 




>- 




0 


0 


0 


< 


*- 


0 






4t 


< 


0 




< 


0 




< 


(J 


< 


0 





u 

< 
< 
< 



h- < 

o < 

H- 0 

t- < 

U 0 

O 0 

O << 

< 0 

I- 0 

0 0 

O >- 

< < 

CJ K 

U> O 

O U 

O 0 

u »- 

0 K 



« O 

< I- H 
0 0 0 

< »- < 
O *- »- 

< < < 

o < u 

I- < 0 

< < 0 

5 3 S 

< < u 

< u o 

o < < 

< < o 

o < 

< H 

< »- 

< t- 
O »- 

O 

»- O 

< 0 

< >- 

< K 

< 0 
< 
U 



t3 

0 
0 



0 



0 
0 
0 



< 

o 
>- 

0 
•< 

O 

o 
u 



u 



u 
I- 

0 

o 

0 

»- 

0 
< 

0 
U 
U 
U 



0 
0 
0 
0 

< < 

< < 

< 

0 
0 
0 

U 
< 
< 

o 

0 

0 



3 t: 3 



< 0 »- h- 

u o < < 

0 ^- 

0 U H O 

0 0 i- 

< < < K 

< U 0 O 
U <J ^ i- 
^ 0 < 
O < »- < 
t- 0 < 
I- (J 0 < 

< J < < 
4 »- O O 
0 < < 



0 
< ' 
U 

o 
u 

< 



0 



0 
< 



0 
0 
< 



0 
0 



o 


o 


O 


N 










a 


O 


< 


u 


u 


< 


u 


< 


< 


w 


0 




t- 


< 




0 






»- 


0 


0 


< 


< 




< 




< 


< 




< 


(J 


< 




o 




H> 


0 






< 






< 


0 


(J 


























0 


0 






if 




0 


^. 




























13 






(3 










0 






^» 






































t3 










0 
































































































0 








^3 


< 


< 


< 


< 




< 


< 


< 


< 




O 


< 


u 


0 


< 




< 


< 


< 


O 


(J 


< 


0 


U 


0 


0 


U 


< 


>- 


0 




< 


>- 


u 




< 




u 


u 


U 




o 


0 


h- 


< 


>- 




u 




»- 


»- 


>- 


< 


0 



o 
o 



U 
U 
0 
< 
0 
< 



0 
< 
U 

0 
< 

a 
< 

0 

< 
< 
< 

0 
0 
»- 
U 
< 

0 

< 

u 

»- 

o 
< 

< • 

>- • 

u 

0 



< 

0 
0 



< 



O 



< H 

0 O 



< 

0 

< K 

»- < < 

>- < 

fr- 0 0 

0 < 0 

»- 0 < 

^ >- < 

H < 

H- < U 

K 0 

H H H 

H 0 < 

0 < < 

H < < 

^ * 

t- e i- 

U < H 

H o 

K 0 H 

H < O 

O U < 

U " U K 

>- u u 

0 0 H 

U ^- U 

»- < 0 

< 0 W 

>- 0 



U 0 

0 0 

3 0 

0 »- 



0 
»- 
U 



u 

0 



< 
< 

»- 

o 
>- 

u 

»- 
H- 
>- 
< 
0 
0 
< 
0 



< 

0 
< 



0 

0 



0 
< 
U 
< 



0 



< 

0 



< 

0 
< 

0 



4 



u 

0 



U 
U 



< 

0 
H 
^~ 
U 
0 



0 
0 
»- 
0 
H 
0 

U 

< 

0 
U 
0 



< 

< 

u 
»- 

0 

0 
0 



a 



< 

0 • 



< 

< 

0 



a 

0 • 

< >» 



0 • 

>- c 
k a. 
h a 
« « 

0 < 
K « 

0 >» 
U 

V- • 

K £ 
a 

o m 

0 

y- u 
0 «• 

o x 
0 

0 - 

< >> 

< -I 

m 

>- c 

a. 

< 3 



< C 



< 
< 



0 

u 

0 

< 
< 

< 

0 
O 



u 

0 



a a 




H • 1 




• 9 


m a 


< — 


H XL 1 


« 


• fl 




u X 


H a. i 


u 


• u 


a a 


0 I 


0 >» 1 


a 


• 9 


■ u 


U • 


0 - 1 


9 




u ~ 




0 0 t 




• 


m a 


U L. 


0 9 t 


u 


- m 


o m 


0 • 


0 L> 1 




• u 


u u 




< < 1 


« 


• • 


• 


u c. 


0 a 1 




• 9 


« a 


o • 


« • t 


9 


* 0 




)- i/t 


0 < 1 


u 


• 0) 


• « 


0 


< >» 1 


u 


• 






0 ~ 1 






C 


^ «n 


0 0 1 




• 9 


W ^ 


0 c 


■ 1 




• 0) 


u A 


0 £ 


0 V * 


01 


• 0 




^ ^. 


^_ 0 1 




• 3 




u • 


0 — 1 




• « 


u o 




H • 1 






W CJ 


0 X 


0 > 1 




a 


y ^ 


0 «•* 


0 • 1 




■ 


• O 


^ % 


t» £ | 


3 




Q • 


« a 


H CL 1 




A U 


Q 




0 C 1 






■ O 




^ *_ 1 




*• 


3 3 




O 0 1 




4^ 




0 ~~ 


H 3 * 




a 






^ # 1 




A 




0 > 


U j | 






CI o 


01 


f» • 1 


3 


3 






0 *~ I 




* a 






*<1 ^ 1 
w * 1 


01 


• 3 










* ^ 




{J 


*t & 1 


3 








0 < 1 




• 






0 <— 1 


3 


• • 


■ o 




• 1 


■ 


• o 


ft o 


(J 


0 > 1 


3 




• Q 




0 3 1 


3 


% m 


o ** 




H- • 1 


3 


• m 






U _j | 


. 9 


** a 






0 3 i 




O 9 


# ^ 




< 1 


3 


44 ■ 






0 0 1 


y 


• u 


* ** 




^ V 1 




a 4^ 




Tj 






• a 


tj ii 




0 < 1 




u a 






0 >» 1 




a a 


9 O 




0 v 1 


|g 


4^ a 


A 




0 0 < 






O 9) 




0 • 1 




«# a 


m H 




w i^T I 














3 || 






0 3 t 




9 4* 


il 




^ • 1 


3 


0 4| 


* ^ 




0 J 1 




o a 


• CI 




0 i* 1 


3 


a 4 


Q 3 


• 


II 1 
VJ £ 1 




a 4 


** O 




! 


y 




** ** 


■ 


0 3 i 


*^ 








^ J 


5i 


* 


O o 


a 


0 0 1 


9 




** 




0 O > 






♦* u 


a 


0 L> 1 


U 


a 4* 


• u 


o 


0 & 1 






■ u 


• 


< > < 


** 


9 U 


u 


M 


0 — t 


9 


U 9 


♦* u 




0 0 1 


*f 




U 4 




• — - 




a *■* 


■** ft 


• 


0 ~ ~ 




** u 


o o 


o 


0 ^ 




** 


*• • 


9 


0 W 






*4 U 




0 C 




• 










9 U 


•# 




0 




^ • 






0 *~ 


3 


9 9 






0 ^ 






M O 








4J« 9 


Ok ^ 




0 S 


y 


■) 9 






^ VI 


w 


u a 


*• o 




0 L 


(j 


a u 




o 


0 • 


a 


* u 






< 






u 


u 


0 k» 


u 


u 




9] 


U £ 


a 


a ■ 


a a 


u 


< K 




m a 


u • 


u 


0 • 


o 


• • 


• a 


u 


>- £ 


' 9 




* u 


u 








4 U 


9 


0 I 


a 


44 «* 


a 


u 


0 £ 




u a 


9 


(J 


< t» 




u a 


m m 


u 


0 3 


• 


• 4rf 


** 9 




h- • 


. u 


a a 


a a 




0 J 


«# 


u +* 


U M 




0 3 


u 


~ a 


u 


u 


H- • 




u fl 




** 


O J 


u 


u a 


•V *« 




0 • 


u 


44 «4 




u 


o > 


u 




m a 


« 


0 


w 


« « 




u 


O 3 


U 


a *- 


- « 


a 


• 


o 


• u 


u 


w 




u 


a a 


*- « 


• 


0 ■ 


9 


44 *« 


u o 




u - 


m 


u a 


— « 




0 < 




** « 


• m 




0 3 


9 


u ** 


u « . 


9 


l- • 


m 


U f 


a - 


« 


U -J 


m 


a a 


« « 


9 


O U 




a « 


A* 1 




£t 


9 


* a 


• 3* 

* "31 




< < 
0 • 


i ** 
• o m 


•> 


9 




»- C 


t u « 


* "?» 


O 




r- a 


i 9 




4* 




>- w 




"J 


«« 


Si 







»- I. I 

4f X I 

3^: 

0 0 I 

< t» * 

0 £ I 

« h- I 

U O I 

0 t. I 

0 a i 

0 « I 

•t > t 

< -j i 
0 c t 

< * i 

9 



u 
• 



9 
* 

u 

9 

m 
« 

9 
4 



9 
9 

-9 
9 
9 



9 

U 



a 

a 

9 

a 

9 

** 
a 

o 
a 



o 
m 

a 



9 

a 
« 

9 
<44 

2 



0 a 

0 c 
0 < ■ 

K • 

0 — 
0 < 

< <- 
0 • 
>- V» 
0 • 

< >• 

< J 
0 « 
0 — 
0 < 
H O 

o t> 
0 o. 

0 • 



0 3 
I— • 

0 -I 
0 O 
0 «- 

u a 

< • 

u — 

0 < 

U • 

0 

W 0 

< 

0 

• 

< a 

0 3 

< — 

0 3 



o a 

0 u 



0 9 
0 (- 



0 ^ 

t- a 



o x 

H- 0 
0 U 
0 • 
< VI 
0 9 
0 L. 
0 «< 
0 • 
£ 

k a. 
0 m 

0 

O 
0 « 

0 X 



S3 



U 

u 

44 
« 

a 
a 
a 



o 
u 
a 



to 



u 

44 

a 



u 
a 



u 
a 



u 
u 
u 



a 
a 

44 

u 
9 

44 

a 



a 

9 

a 

4* 

a 
a 



u 

u 



5 

n3 



o 

»44 

a 

44 

a 
m 

44 

a 
a 
a 
a 



a 

0 ■ 

< > 

< J 
0 C 

< — 
0 0 
0 U 
0 £ 

< K 
0 • 
« X 

< -J 
0 o 
0 c 
0 a 

0 ** 



< • « 

0 < « 

0 C 44 

0 £ U 

< 44 
O * *- 

< — •* 

or 9 

0 9 

0 i. - 

0 < O 

0 C 44 

< — u 

0 0 O 

0 « 9 
0 — 
0 < 
9 

0 U 
0 < 



6* ■ 



* 

9 



0229750 



^ Si 

I U 13 









r- 


a» 




co 


f- 




o * 


>— • 


< 


< >• 




CJ 


< -J 




< 


u c 






< • 


^ — 




< < 


13 O 


< 


O 


o o 




u r 


o «- 




< v- 


< 


13 


K e 


o a 


< 


o • 




o 


»- «1 




(3 


< o 


o c 


< 


CJ C. 


< — 


< 


o o. 


o o 


< 


(J O 


O 3 


< 


u c 


-< — 




CJ CL 


a c 


>- 


13 C 


•* • 


< 


< — 






u 13 


< _j 


o 


H- u 


o • 


<< 


< >, 


«< > 


<. 


*- *~ 






a 


a • 




m 


■< >. 


-< 


u 


•t -j 


u 


m 


•< >■ 


< 


4» 


o - 


13 




o o 


■< 


+0 


< a 


(J 


a 


(3 t 


13 


** 


< < 


< 


« 




(3 


a 


1— — 


a 


** 


<. ~ 


< 


a 


13 C 


a 


** 


< — 


< 


+* 


u o 




** 


a 3 


< 




f- • 






U *J 




« 


v- t_ 




** 


0 • 


o 


** 


-< *rt 




■ 


< fl 


< 


• 


cj — 


a 


u 


0 < 


o 


a 








— 


< 


« 




u 


« 


<: c, 


(3 


y 


U JC 


o 


*# 


* H- 


•< 


u 


C!5 > 




** 


13 — 




** 






a 


L3 3 


< 


** 


■< — 


a 




o a 






u « 




• 


«< >■ 


<< 


u 


< -j 


< 


u 


a c 


< 


« 


< — 




■ 


CJ C3 




a 


< 3 


< 


a 


< — 


O 


a 


O O 


o 




o >. 




■ 


o - 




u 


o a 


13 


u 


< >» 




• 


C3 — 


u 


ci 


o o 


•< 


« 


< o 


o 




cj u 


a 




< 





^> 




M A 












n 


10 




v (9 O 










n m v 


o 


*~ 




M r*S S 










m n 


rj 


CO 




O O C3 










▼ ♦ f 




r* 


™ 












ci a 








* o ** 


>j 








o ** 








* ? ^ 




13 




*^ 








• 


* ™ ** 










o a 


C3 




3 


* 










O fl 




(3 


9 


***** 








J' 










• CI fl 


CJ 










< 




U 


• fl a 




CJ 




™ 


CI fl 








• ** a 










?i *? 


< 


< 




• a ** 


CJ 


CJ 




a 


CI fl 


< 






• • • 


CJ 


o 




a 


CI fl 


(3 


< 


♦* 


• a a 


CJ 




c3 


** 


O fl 


< 


< 


u 


• • fl 








Cl 


fl o 


< 


< 


a 


• fl ** 


o 






a 


+* u 


< 




*« 


• a a 




13 


•t 


4* 


fl fl 


< 




■ 


- a ■ 


< 


CJ 


K 


o 


♦* 4* 


13 


< 


• 


a a 


13 


S 


CJ 


a 


c* a 


O 


< 


■ 


a a 


O • 


13 


H- 


a 


4* 






u 


• m 


< • 


13 


V- 




a 4* 


< 




u 


J3 ** ** «a 






13 




a ** 


< 




o 


JC ** 


c3 *- 


< 


>- 


u 


** a 


< 


< 


u 


a 


K • 


O 


•< 


♦* 


4* U 


< 




a 


tn a 


< 31 


< 


< 


fl 


a a 


O 


< 


• 


— a fl 


o a. 






fl 


** a 


< 


< 




fl fl 


(3 c 


< 


< 


a 


40 40 


o 


•— 


*« 


a u 


< < 


■< 






u a 


o 


o 


m 


• o 


CJ u 




CJ 


o 




*- 






* a u 


< >■ 






** 


■ CJ 


< 






• fl a 








CI 


• 




o 


• 


- a a 


CJ c 






** 


40 M 


< 


o 




• +* a 


•< • 


CJ 




o 


fl (J 


< 






« o a 


*t ^ 






u 


a o 


< 






• u 


13 fl 


< 




♦* 






K- 




- a ** 


< > 




CJ 


•** 


O 40 


< 




* 


- a 




< 




fl 


40 U 


< 




a 


• • a 


CJ c 






fl 


a a 






* 


u a fl 


< • 




o 


fl 


** A 


a 


< 




o • ■ 


< < 




V- 


4* 


a a 




1- 




a a a 




>- 




o 


fl u 


<j 


U 


** 


a a o 


o — 


13 


13 


fl 


U fl 


>- 




a 


• • ** 


13 13 


< 




CJ 


* u 


u 


u 


+* 


44 ■ 


< m 


< 


CJ 


a 


u 


1- 


•< 


m 


** m o 


CJ — 


>— 


C3 




w CJ 




u 




U fl 


13 < 


< 




4* 


■ 


< 


•c 


m 


a « a 


h- C 


< 




a 


** fl 


(3 


(3 


** 


Of « 


13 • 


< 




a 


a 


< 


«< 


• 


a a « 






13 


u 


*0 u 




< 


• 


« a** 


S'i 


< 




o 


♦* 


< 




u 


u a a 


C3 — 


■< 




4* 


40 40 




< 


*• 


u « u 


13 13 


CJ 


13 


■ 


a ■ 


1- 




a 


*- o a 


< a 


CJ 


O 


a 


** fl 




< 


** 


a o a 


13 U 


< 




a 


CI u 




< 


a 


• « • 


< 


>- 




4* 


a ci 


U 


u 




« o • 


*- c 


CJ 


o 


fl 


a fl 


U 


o 


o 


u o +> 


13 • 


CJ 


< 




a ci 


< 


o 


u 


a a a 


< *rt 


< 


13 


a 


CI fl 


• < 


< 


a 


• u a 


< fl 


CJ 


C3 


-+* 


** u 


»- 


o 




m ■*+ 


o — 


< 




** 


u a 




< 




40 U O 


O < 


< 


CJ 


a 


• fl 


< 






U *• fl 


CJ c 


13 


CJ 




fl ■ 


< 


< 


• 


*rf ** fl 


< fl 


13 


f- 


a 


ci a 




-< 


a 


•** « ci «e < < 


»- 


o 


a 


a « 


< 


>- 


■ 






O 


a 


o « 




<<. 


« 


■ a u 


• -< >» 








• u 


< 




a 


o a a 


< -i 


CJ 




fl 


** 40 


< 




a 


a*- « 


13 3 


< 


< 


u 


u a 


< 




♦« 


♦* a 


• 


< 


< 


u 


** u 




< 


fl 


o *• ** 


>- -J 




< 


a 


4* 40 






4^ 


a o « 


fl 




< 


•** 


4* a 




-< 




a ♦* ** 


< — 


H- 


CJ 


a 


u « 


< 


< 


a 


« a u 


u Z 


O 


13 


« 


*- a 


«3 






a** ** 


< — 


»- 


t- 


fl 


4* U 




< 


u 


a u u 


»- « 


u 


H- 


u 


u a 


<3 


< 




a 4* 


13 > 


u 


CJ 


■ 


• « 


»- 




fl 


** m o 


< 3 


< 


CJ 


CI 


u o 




< 


■ 


• fl a 


< — 


>- 


< 


u 


a a 


5 






u ■ * 


C3 O 


»- 


CJ 


a 


40 m 


>- 


< 


4* 


** *j # - 


4 a 


13 


O 


• 


40 u 



-T5 
c 
o 



u 
a 



fl 

u 
a 
a 
a 

40 

(J 

u 
a 
■ 

a 
a 



fl 

o 



< c- 

CJ c 

< t- 

13 « 

< X 

< -1 

< o 

CJ l> 

CJ CL 

o a 

C3 l 
>- H 
>- > 
13 - 
O C3 

< fl 

< > 

< -I 

o a 
13 I 

< < 

< o 

O C 

< < 

13 C 

< — 
CJ 13 
*- \~ 
CJ • 

V- ^ 

13 fl 

< x 

< J 

c* u 

CJ c 

CJ c 

< < 



< ■ 

< -I 

-< V 

13 — 

C3 C 



< • 

•< >> 

< -l 
cj > 
13 - 
o o 

< a 

13 U 

< < ■ 
CJ • 

' O > 

t CJ 

I o o 

[~ CJ - 
^C3 < 
I- c 

< fl 

< < 

C3 u 

< < 



o - 
O v3 



13 








40 40 4* 




o 


>- 


• 


< 


< 


40 




m « o 




< 


3 




13 




40 




o a a 




CJ 






< 


o 


40 




a « 




C3 


¥- 


u 




>- 


4* 




o a fl 


fl 




13 


40 


CJ 


< 


• 




u ** o 


*• 


CJ 


3 


CI 


13 


< 


■ 




o a u 




»- 




u 




< 


« 




40 4* • 


*0 


CJ 




a 


3 


< 


4* 




** fl 


U 


O 


< 




O 


< 


+0 




0 


U 




1- 


CI 


1- 


u 


40 




*- U fl 


♦* 




V- 


Cl 


CJ 


13 


+0 




ci « a 


CI 


»- 




CI 


13 


C3 


■ 4* 




u -m a 


40 


CJ 


CJ 


4* 


1- 


>— 


fl 




Cl CI « 


00 


CJ 


I- 


a 


U 


13 


u 




40 (J 


40 




O 


< 


CJ 


< 


a 




u 




< 


»— 






CJ 


40 




« « O 


4* 


O 


K 




3 


< 


fl 




a fl ♦* 


CI 


13 


-< 


CJ 


< 




40 




fl O 40 


• 


< 


CJ 




*- 




■ 




o o 


ci 


13 


< 


< 


C3 


< 


40 




u • u 


40 


CJ 


< 




< 




u 




CI fl CI 


a 


CJ 


CJ 


< 


< 


C3 






** a a 


40 


< 


CJ 




< 


< 


♦* 






a 


u 


»- 




< 


< 


u 




a • « 


« 


CJ 


»- 




*3 


< 


a 




a a u 




13 


< 


< 


< 


CJ 


a 




« « a 




»- 




1- 




13 


. « 




a u ~ 


a 


< 


< 


< 


3 


CJ 


fl 




• a a 


a 




< 






13 


*0 




9 .** 


« 


< 


< 


< 


< 


< 






fl « U 




o 






< 




< 


< 


u a u 


*0 


»- 




< 


< 




< 


< 


« « 


« 


o 


< 






C3 


< 


> 


a *« u 




< 


< 




s 


< 


< 




*- « o 






CJ 




< 




< 


• 


a a - 


■4* 


< 


< 


I 






»— 






O 








< 


b 


< 




«*- a- 


fl 




< 








< 




■ — u 


fl 


>- 






< 


13 


< 




4 U U 


fl 


Id 


< 


< 



U fl 

u u 

40 40 
4* Cl 

*- Cl 

a** 
** u 

Cl 

« a 



a ** ** 

fl o 

40 m a 

^ fl fl 

** a a 

a u a 

a ci ci 

40 a « 

940 u 

• ** fl 

fl 40 a 

* « 

O 40 40 
40 fl 

a ** 

40 fl 

Cl Cl ** 

U Cl •* 

fl fl ** 

« 40 

U *0 • 

a fl ci 

a u a 

<j *# ** 



022975 

FIGURE 6 , 



10 20 



-Met-Gly-L78-Il e -Ser-S e r-L e u-Pro-Thr-Gln-Leu-Phe-Ly»-Cys-Cys-Phe-Cys-Asp-Phe-Leu 



30 AO 



Lys-Val-Lys-Met-His-Thr-Met-Ser-Ser-Ser-His-Leu-Phe-Tyr-Leu-Ala-Leu-Cys-Leu-Leu 

50 60 
Thr-Phe-Thr-Ser-Ser-Ala-Thr-Ala-Gly-Pro-Glu-Thr-Leu-Cys-Gly-Ala-Glu-Leu-Val-Asp 

70 80 
Ala-Leu-Gln-Phe-Val-Cys-Gly-Asp-Arg-Gly-Phe-Tyr-Phe-Asn-Lys-Pro-Thr-Gly-Tyr-Gly- 

90 100 
Ser-Ser-Ser-Arg-Arg-Ala-Pro-Gln-Thr-Gly-Ile-Val-Asp-Glu-Cys-Cys-Phe-Arg-Ser-Cys- 

110 120 
Asp-Leu-Arg-Arg-Leu-Glu-Met-Tyr-Gys-Ala-Pro-Leu-Lys-Pro-Ala-Lys-Ser-Ala-Arg-Ser- 

130 140 
Val-Arg-Ala-Gln-Arg-His-Thr-Asp-Met-Pro-Lys-Thr-Gln-Lys-^r-Gih-ProiPrp-Ser-Thr- 

150 160 

Aim-Lys-Aj;n-Thr-Lys^ 

"0 180 
Gln-Lys-Glu-Gly-Thr-Glu-Ala-Ser-Leu-Gln-Ile-Arg-Gly-Lys-Lys-Lys-Glu-Gln-Arg-Arg- 

190 

Glu-Ile-Gly-Ser-Arg-Asn-Ala-Glu-Cys-Arg-Gly-Lys-Lys-Gly-Lys-COOH 



"3 



0229750 



o 

U 
h- 
OU 

— >— 

- < 

o 
u 
cj 



o u 

o < 



cj 
cj 



o < 
a cj 

3 

O 

< 
< 

o3 



cj 
a o 



3 

o o 

IP h- 
< 
< 

o 
o 
o 
o 
>- 
CJ 
o u 
» o 



3 

oo 

^ •< 

o 



a >- 

« cj 

>- 

cj 
< 
cj 

o 

o cj 
CM < 
cj 



o cj 
- o 



t 



3:3 = 

r> »- - w 
CM H - H- 



cj 
O 
h- 
U 
O 
O h- 
«n O 
cm < 
< 



3 :S 
3:3 



cm cj 
CJ 
cj 



O 
cj 

o »- 

o u 

cm w 
O 
< 
cj 
o 
< 

0 < 

01 < 

— < 

< 

a 
o 
o 

< 
O < 

CO CJ 

— O 



C3 
< 



o u 
^ < 



O 
o < 
«© cj 



< 

u 
>- 
>- 
o < 
m < 
— u 
u 
< 



cj 
o 

»- 

o cj 

M" CJ 



<n cj 



U 

cj 
I- 



' cj 
' o o 
• a 
. u — 

> a 

- < 



• cj 



. u o 

' CJ CO 



' cj 

• »— 

' O 

• < 

' o 

• C3 

• < a 
> CJ 



. o a 
. H CO 

■ < 

■ o 
•O 

■ < 

• < 

■ o 
» < 

■ CJ O 

• t- IO 



. CJ o 

:3" 



• CJ O 



» < 
» < 

► o 



> o o 
. o cm 



■ < o 

. u — 



u 
o 
o < 

kO C3 

cn k 

CJ 



u 
o 

O K 
^ CJ 
CJ K 
CJ 

O 

CJ 



o 
r> 
u c« 

■ O 

• < 

• o 

• ¥* 

• CJ 

:S 

f o o 

> O cm 

> CJ CM 

• O 

• K 

> CJ 
- *- 

■ CJ 

■O 

»CJ 



3=32 



C3 
O O 
rj o 
ro CJ 
< 
O 

o 

U 

o 

c5 
O CJ 

« < 

« CJ 

u 
o 



CJ 

o 
o < 

— CJ 
P9 U 



CJ 
CJ 

< 

o u 
o 

en u 

C3 

CJ 
CJ 

o 



01 O 
o* O 
CJ 

o 

>- 
CJ 

CJ 

O l- 
» CJ 
»- 

CJ 

CJ 

CJ 

o o 
r» cj 
IN >- 
u 

CJ 

I- 
u 

CJ 

o 

O I- 
CD < 

r« cj 

CJ 

< 

CJ 

< 

CJ 

O 
»- 
o < 
m c3 
< 



» < CM 

- o 

■ o 

► u 

• o 

• < 

• o 

- o 

■ h- a 

• cj a 
. o « 

• o 

- CJ 

• < 

• CJ 

• CJ 

-O 

• >- 

• u o 

■ k a 

. CJ — 

• < 

• CJ 

• u 

■ < 

• CJ 

■ K 

► K O 

■ cj m 

• cj - 

• < 

► CJ 

• *- 

' u 

, o 

' K 

• CJ 

• CJ O 



• CJ 

• o 

• CJ 

• o 

• o 

■ o 

> u « 

> o — 

• < 



. CJ o 
' < — 

> CJ 

- o 

• CJ 

- K 

' u 

• u 



- CJ o u 



• o 

■ K 

■ < 

• CJ 

■ CJ 

■ < 

■ CJ 

• < o 

- CJ « 

' o - 



:3 



3 

u 

C 

23 

< 

3 

< 

3 

< 

O < 

23 

C3 

3 



a a 



< 

<; 
O CJ 
CD (5 

« < 

3 

< 

CJ 
< 

23 



cj 
O < 

o 



m cj 



CJ 
< 

o 
< 
o < 

« CJ 
CJ 



3 



> C3 

3 

CJ 

■ «c 

> CJ 

■ C3 



CJ 
CJ 

• < 

. o o 
. o e 

• « IO 
CJ 

o 

• CJ 
CJ 

• < 

CJ 
CJ 

C3 O 
to 
< to 

• CJ 

< 

< 

• a 

• < o 
■ •< * 
» C3 ia 

»- 

a 



u m 

CJ 
I- 
CJ 
CJ 
CJ 



• < 

- ^ o 
■ O cm 

• O *D 



< o 

CJ — 

• < » 

CJ 



CJ o 

< a 

< IO 



o < 
« < 



< 

CJ 

• a 
*- 
a o 



o < 

tN CJ 

CJ 

f- 

c9 
< 



3 =3 
3 "2 

c5 



> CJ 

a 
u 
• < 



3 H3 



o 



u •• u o 



o 
a < 

- CJ 



4 

u 







< 


- < 








< 


- < 


< 


< 


h- 


< 


- < 




< 






- < 


< 


u 


O 




< 




C3 






- < 


U 


< 




c? 




o w 


u 




o 5 


H 


to u 




C3 


^ »- 


CJ o 


O CJ 




-oo 


at < 


K O 




< 




u 


< «3 


o 


u 




< 










< 


- < 


u 


< 




< 






C3 


CJ 


3 


< 


CJ 


3 




< 






< 




< 


- < 


O K 




- o 


O h» 




kO < 


< 




<n cj 


« Q 


O H- 


3 


k a 


a» o 




— CJ 




- < • 


CJ 




< 




cj m 






CJ 




< 


CJ 


< 


< 


< 


- < 






CJ 


3 


CJ 




- < 






u 


< 


l- 


3 






C9 


< 


o < 






o < 








< 




< O 


o < 




CJ o 


Ck < 


- < CO 


^ < 


< 


- < r- 






< 


CJ 


< 0 


f- 


< 


< 


c? 


< 


*- 


< 


< 


s 




H 


< 


CJ 


< 


CJ 


U 


< ■ 


o 






CJ 


< 


s 


< 


- < 


< 


- < 


o < 


CJ 


— o 


o < 




r> o 




— < 




< o 


o < 


o 


CJ o 




< r* 


< 


< 


- < « 


< 




< 


CJ 


I- « 




- < 


< 


< 


< 










— < 


< 


- < 


< 


< 


- < 








f 




< 


- < 


H- 


t 




< 




O < 


1 


< 


o < 


u 


CM < 


»- 


< 


o a 


H O 


O < 




cj a 


<* ts 


>- CO 


^ < 


< 


CJ «3 




O r* 


< 




u « 


< 






o 


CJ 


3 


< 


< 


< 


CJ 




< 






i- 


o 




<< 


s 


»- 




< 


< 


< 




< 


— < 


a < 


CJ 


a 


o o 


CJ 


< 




CJ 




- h- O 


o < 




- o o 


• < 


«• < to 


«- < 


C3 


— c3 * 


3 






< 






- H* 




< 




< 


H- 




o 


< 


3 


1- 


< 




o 


< 


CJ 


c> 


< 






- K 




< 


- < 


o 


< 


o < 


< 


— < 


o o 




o < 


< 


- < 






o < 


o 




« o 








»- 


C5 


-OK 


< 


< 


1- « 


>- 


- H- 


u 


< 


— -< 


u 


- CJ 


o 


< 


CJ 


o 


- o 


3 


< 


- < 




- >- 




< 


- < 


CJ 




o 


< 


>- 


<J 


1 


< 


CJ 


< 


o o 




a cj 


O 








01 < 




»- o 


« < 




Ck O 




^- « 






< 


3 




C9 




< 


< 


o 


< 






CJ 




< 






O 


< 






< 






< 




< 


< 


- < 


o 






3 


- < 


o < 

S3 


"3 




»- 






< O 


at cj 


U 


< — 




-HO 


O 




< «0 


< 


3" 


< 




CJ 


< 




< 



3 

< 

O 



CJ 
CJ 



Hi' 



3 

< 

o 

3 



< o 

< 41 

o 

CJ 
CJ 
M 



< 

o o 



< 
< 

o < 

S3 

u 
. c* 
< 
< 
< 
< 

o c 



: 3 

CJ CM 

u rs. 

3 

• < 
■ < - 

. CJ r*. 
. O 

CJ 



15 g 



3 

< 



o 

< 

»- 
o 

s: 

< 



< 
< 

S3 



< K 
< 



O < 



< 
o < 



o < 
— < 



< 

< 
o < 
o < 



o < 
0> < 
o < 



3 

o < 
o < 

< 
< 



O -4 

o < 




European Patent 
Office 



EUROPEAN SEARCH REPORT 



Pace 2 

D229750 

Application Number 



EP 87 87 0001 




Y,P 



X,P 



EP-A-0 216 742 (MONSANTO CO.) 

* Page 5: "Uses of the peptides"; 
claims 4-6,14-15,20-22 * 

THE JOURNAL OF BIOLOGICAL CHEMISTRY, 
vol. 261, no. 11, 15th April 1986, 
pages 4828-4832, The American Society 
of Biological Chemists Inc.; P. ROTWEIN 
et al: "Oraanization and sequence of 
the human insulin-like growth factor I 
gene" 

* Whole article * 



27-30 



24-26, 
31 



TECHNICAL FIELDS 
SEARCHED (Int. CI.4) 



The present search report has been drawn up for all claims 

Date of completion of the search 



Place of search 

THE HAGUE 



05-07-1989 



CASTRO Y GONZALEZ R.F. 



CATEGORY OF CITED DOCUMENTS 

X : particularl\ relevant if taken alone 

V : particularly relevant if combined with another 

document of the same category 
A : technological background 
O : non-written disclosure 
P : intermediate document 



T : iheon or principle underlying the invention 
E : earlie'r pateni document, but published on, or 

after the filing date 
D : document cited in the application 
L : document cited for other reasons 

dTmemberof the "same patent fanitiv. corresponding 
document 



i 




v 




Europaisches Patentarfit ]f 

European Patent Office © Publication number: Q 229 750 

Office europeen des brevets A3 



© EUROPEAN PATENT APPL8GA" 



3 HO 



© Application number: 87870001.2 - @lnt.Ci. 3 :C 12 W 15/00 

©D-foffmng: 06.0L87 C 12 P 21/02 ' A 61 K 3 ™2 



© Priority: 07.01.86 US 816662 
20.11.86 US 929671 



(43) Date of publication of application: 
22.07.87 Bulletin 87/30 

© Date of deferred publication of search report: 25.10.8S 

© Designated Contracting States: 

AT BE CH DE ES FR GB GR IT L! LU NL SE 



© Applicant: WASHINGTON UNIVERSITY 
Lindell and Skinker Boulevards 
St. Louis Missouri 63130{US) 

(72) Inventor: Krivi,GwenGrabowski 
9400 Harvest Court 
St. Louis Missouri 63132{US) 

@ Inventor: Rotwein, Peter Scott 
19 N.Taylor 

St. Louis Missouri 631 1 0(US) 

Representative: Lunt, John Cooper et al, 
Monsanto Europe S.A. Patent Department Avenue de 
Tervuren 270-272 Letter Box No 1 
B-1150BrusseIs(BE) 



CO 

< 

o 
w 
is 

o 

cm 

o 

ft, 



© Human preproinsulin-Iike growth factor I. 

© The invention provides DNA sequences encoding a 
human preproinsulin-Iike growth factor I protein and novet 
extension peptides. A novel preproinsulin-Iike growth factor I 
protein and novel extension peptides are also provided. The 
present invention further provides a human IGF-I gene which 
has been sequenced and which encodes at least two pre- 
proinsulin-Iike growth factor-l proteins. Various genes and 
DNA sequences useful in producing essentially pure mature 
IGF-I r preproinsulin-Iike growth factor I proteins and IGF-1 
gene related proteins are also provided. 



Croydon Priming Company Ltd. 



European Patent 
Office 



EUROPEAN SEARCH rIpORT* 



0229750 



Application Number 



EP 87 87 0001 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document with indication, where appropriate, 
of relevant passages 



Relevant 
to claim 



CLASSIFICATION OF THE 
APPLICATION (Int. CI. 4) 



Y,D 



Y,0 



Y,D 



Y,P 



PR0C. NATL. ACAD. SCI. USA, vol. 83, 
January 1986, pages 77-81; P. R0TWEIN: 
"Two insulin-like growth factor I 
messenger RNAs are expressed in human 
1 i ver" 

* Whole article * 
IDEM 

PR0C. NATL. ACAD. SCI. USA, vol. 82, 
October 1985, pages 6450-6454; G.I. 
BELL et al.: "Isolation of the human 
insulin-like growth factor genes: 
Insulin-like growth factor II and 
insulin genes are contiguous" 

* Whole article * 

CHEMICAL ABSTRACTS, vol. 100, 1984, 
page 139, abstract no. 186585f, 
Columbus, Ohio, US; A. ULLRICH et al . : 
"Isolation of the human insulin-like 
growth factor I gene using a single 
synthetic DNA probe", & EMB0 J. 1984, 
3(2), 361-4 

* Abstract * 

NATURE, vol. 306, no. 8, December 1983, 
pages 609-611, Macmillan Journals Ltd; 
M. JANSEN et al . : "Sequence of cDNA 
encoding human insulin-like growth 
factor I precursor" 

* Whole article * 



1-23,31 



24-30 
1-26,31 



1-26,31 



1-26,31 



WO-A-8 600 619 (CHIRON CORP.) 
* Whole document * 



27-30 



The present search report has been drawn up for all claims 



C 12 N 15/00 
C 12 P 21/02 
A 61 K 37/02 



TECHNICAL FIELDS 
SEARCHED Out. CI.4) 



C 12 N 
C 12 P 



Place of search 



THE HAGUE 



Dale of completion of the search 

05-07-1989 



Examiner 



CASTRO Y GONZALEZ R.F. 



c 

1=. 

c 



CATEGORY OF CITED DOCUMENTS 

X : particularly relevant if taken alone 

Y : particularly relevant if combined with another 

document of the same category 
A : technological background 
O : non-vtritien disclosure 
P : intermediate document 



T : theory or principle underlying the invention 
E : earlier patent document, but published on, or 

after the filing date 
D : document cited in the application 
L : document cited for other reasons 

& : member of the same patent family, corresponding 
document 



